Lesson 38: Bootcamp recap

(c) 2018 Justin Bois. With the exception of pasted graphics, where the source is noted, this work is licensed under a Creative Commons Attribution License CC-BY 4.0. All code contained herein is licensed under an MIT license.

This document was prepared at Caltech with financial support from the Donna and Benjamin M. Rosen Bioengineering Center.

This lesson was generated from a Jupyter notebook. You can download the notebook here.



Hello, world

In [1]:
print('hello, world')
hello, world

Command line mastery

"Ugh. I'm just going to do it on the command line." --Overheard at bootcamp

Scripting

You now have a very efficient assistant.

Version control and TDD

You can now sleep easy

...and collaborate

...and do better, reproducible, sharable science

...and the bootcamp has code on the internet for all!

Data types

  • ints
  • floats
  • strings
  • bools
  • lists
  • tuples
  • dictionaries
  • sets
  • functions, file handles, NumPy arrays, Pandas DataFrames, ...

NumPy arrays

Slicing and dicing

Random number generation (probability is the mathematical language of biology)

Anything you want to do with groups of numbers

Altair

Your plotting is

automatic

grammatical

reproducible

beautiful.

Pandas

Your data are organized.

And tidy.

And accessible.

As a hacker

Statistics are tractable and understandable.

You can simulate complex processes.

Your data

have a new master. Whether images, time series, counts, you name it.

You will never forget

Explicit is better than implicit.

The most important person in your life is future you.

Premature optimization is the root of all evil.

Plot all of your data.

Readability matters.

This is the beginning!

I look forward to seeing what you will do!