Skip to main content


One of the (many) good sides of teaching: you get plenty of time to delve into topics and develop your own material (and code).

I now want to share part of my materials as a #DataScience textbook online:
https://florian-huber.github.io/data_science_course/book/cover.html
(work in progress)

All materials and source code is on #github: https://github.com/florian-huber/data_science_course

And a pdf version can be found on #zenodo: https://zenodo.org/records/10074475

#opensource #Openscience

in reply to Florian Huber

wonderful resource - thanks for sharing! I had long toyed with the idea of doing something similar for bioinformatics...hopefully one day I will get to it.
in reply to Florian Huber

this is great! But I disagree with teaching newbies with Anaconda (or variants) since it creates massive confusion on which Python is the right one and often leads to issues with installing new code via pip
in reply to Charles Tapley Hoyt

@cthoyt what is a good alternative? Conda is nice for new students because they can get tools installed fairly quickly.
in reply to Frank Aylward

@foaylward installation of Python from the first-party installer works well for Windows and Mac.

I know that veteran programmers love environments, but this is just another confusing abstraction that students have to understand. Better to say "hey, if you want pandas, do `pip install pandas`" and it will just be there. I don't think the 3 conda steps starting at https://florian-huber.github.io/data_science_course/book/03_use_of_this_book.html#step-3-create-a-new-environment are going to make any student say "I love python"

in reply to Charles Tapley Hoyt

@cthoyt oh I see - yes I can see how that would work too. I teach a bioinformatics class on an HPC running Linux and so I've found conda is simplest. But I do have a full week of logging in/ conda initialization in the beginning to get everyone started.

Lo, thar be cookies on this site to keep track of your login. By clicking 'okay', you are CONSENTING to this.