Conversation
|
sounds like the idea is to do merging of excel files with pandas and do some plots with matplotlib. You may want to look at seaborn. |
|
Just keep pushing to GitHub as you are doing. Your setup looks good. You now just have to add some modules containing functions in the nobel_winners sub-directory for some easy points. Have a look at the rubric too. Let us know if you have any pandas questions too if you need help working through the csv processing. |
|
Hi, John. I have a question: in the "born" column of my DataFrame "Nobel", there are birth date in formats of "06/15/1984", how can I extract only the year "1984" from that column? Thank you. |
|
Have a look at [this](https://jakevdp.github.io/PythonDataScienceHandbook/03.11-working-with-time-series.html#Pandas-Time-Series:-Indexing-by-Time).
A quick glance makes me think the following but let me know if you have
problems.
```
nobel['born'] = pd.DatetimeIndex(nobel['born'])
nobel['born'].year
```
|
|
Hi Xing, Well done on the project. Pandas is a great tool to use when working with datasets and you use it to organize and plot your data nicely. The Jupyter notebook is also a nice addition to be able to visualize your plots. In the future, I think adding the .ipynb_checkpoints into your .gitignore file is a good idea. Also, it's a good idea to use the README.md to give a brief description of your code and its purpose; it will make it easier for others to understand and also for you to come back to your code and refresh your memory if you don't look at it for a while. Overall, nicely done! Cheers, Emily |
No description provided.