Best way to learn data science?

RocketSurgeon85 · 17 時間前

While my official title is not "Data Scientist" (I'm a post doc at a US DOE national lab), about 75% of my day-to-day involves what I would consider data science using numpy,scipy,scikit-image, some pandas, matplotlib, etc...

I would suggest finding something you are interested in and doing some "data science" on it. My personal opinion (which is worth what you have paid for it) is that it is best to learn by doing, rather than just reading. The reading and courses will help, but that is only a tiny fraction of it. Things you may be able to do:

Analyze stock tick data
Find out some information about sports players and their statistics
Look at currency market data (there is a lot of historical data for bitcoin readily available for various exchanges)
Analyze ebook data (for common words, sentence length, ...)
Analyze twitter feeds/trends (similar stuff to ebooks, and you can throw in some info about geospatial location)
Look at price data of a product/s as a function of time on something like amazon or newegg (you can learn some simple url scraping with this too)
Learn something about your local region with weather data.

I'm sure there are more options that others can think of too.

Good Luck!

IOvOI_owl · 17 時間前

http://datasciencemasters.org/ seems to be a good collection of resources and books.

jstutters · 16 時間前

You don't say much about what your background is. Do you already know some Python, stats and a bit about scientific method? If not, you might be better off doing more focussed courses on those first. If you've got the background then I would complete (including the exercises and projects) any of the courses you listed above and then grab something easy off Kaggle or come up with your own managably small project and see it through to completion.

TLDR- learn something, figure out what you still don't know, iterate

fnord123 · 18 時間前

For many people, the best way to learn is to do. So find a project and get to work. Enter a competition on kaggle. Or scrape reviews from a website and write a sentiment analyser. Pull down financial data off Yahoo and use it to determine a trading portfolio where you only change positions each week (or month). Write a spam filter.

rishsriv · 12 時間前

Harvard's CS109 is one of the most comprehensive data science courses out there, IMO. It's rigorous, but you'll learn a lot from it if you follow it till the very end.

StringyLow · 9 時間前

I found this a couple weeks ago:

How to become a data scientist

westurner · 14 時間前

https://wrdrd.github.io/docs/consulting/data-science.html

shaggorama · 12 時間前

I mean, the absolute best course of action would be to get a masters in CS, math or statistics with a focus on applied methods and machine learning.

The coursera machine learning course is really good and is what got me started down the path. I took the innaugural course a few years ago, studied a bunch on my own, went to grad school, and now I'm working as a data scientist. It can be done.

Notre1 · 11 時間前

For some more ideas, check out the curriculum linked below and some of the comments on it from Hacker News. I don't have anything to do with it, but I just remembered it looked really well put together when I saw it a few months ago.

https://www.mysliderule.com/learning-paths/data-analysis/learn/
- https://news.ycombinator.com/item?id=7815906

toodim · 12 時間前

If you are into MOOCs Coursera's Machine learning with Andrew Ng and edX's Machine Learning course from Caltech (just ended...) go into greater depth. Udacity released an intro machine learning course last month that uses scikit-learn; it doesn't go into much mathematical depth but it covers a lot of different topics and uses python.

Prinkster · 11 時間前

Depends on the kind of thing you want to do! If you're more interested in the statistics aspect, one way to start would be to get a copy of an old edition of a textbook (can usually be bought for under $20 used on Amazon, for example) for something like Econometrics (the Wooldridge book is highly recommended) and working through all of the computer examples. This would be a nice way to start if you're interested in modeling and statistics.

pybokeh · 8 時間前

Without knowing your background, basics or foundations first:
Learn statistics, linear/matrix mathematics, and learn all the ancillary skills centered around data analysis life cycle:
- obtaining data
- cleaning or transforming data
- analyzing data
- visualizing data
- presenting data to draw conclusions or drive business decisions

The above was just a tools/skills agnostic point of view.

Now for tools and skills that are used to perform the above:
Obtaining data: often requires SQL and database knowledge
Cleaning or Transforming data via Python or R or Excel, etc
Analyzing data via Python or R or Excel, etc
Visualizing data via Python or R or Excel, etc

For practical uses, others have already provided good suggestions:
- create a simple database using sqlite, then progress to MySQL or Postgres
- web scrape data (maybe use scraped data to populate your database)
- if you're already into raspberry pi or arduino, use data collected from sensors and analyze and chart that too
- check out /r/datasets for data set ideas or /r/pystats
- check out kaggle competitions

Check out blogs or github accounts from prominent people or organizations in the data science fields:
- Rob Stoy's Python visualization stack
- www.gregreda.com
- Simple to follow exploratory data analysis example
- yhat

Hope this helps!

iNeverHaveNames · 6 時間前

You may have already found this and its not really a structured course, but youtuber SentDex does a ton of hands-on videos involving several types of big data analysis.

shashwat986 · 18 時間前

Learn Machine Learning, not Data Science. Data Science is basically an application of ML.

https://www.coursera.org/course/ml

https://www.edx.org/course/learning-data-caltechx-cs1156x#.VIqibTGUcms

ユーザインターフェイス用言語	(*) 未完成翻訳ボランティアに立候補する

Python

調停者

この操作にはログインまたは登録が必要です

新しいアカウントを作る

sign in