Python for Data Analysis and Natural Language Processing
As I’m making my way through Natural Language Processing with Python and Data Science from Scratch: First Principles with Python, the first step is to set up the development environment.
My first attempt was to install numpy, python, nltk, matplotlib, IPython, etc, one at a time. However, I hit a few clashes between Python versions, so switched to Anaconda instead:
- Download Anaconda
- From the download folder, execute
- sh <name of downloaded file>
- Accept the defaults, but answer yes to preprend to PATH
- To check it works, start Python
- import numpy
- import pandas
- import matplotlib
- Download the nltk assets. Start Python and enter:
- import nltk
- nltk.download()
- use GUI to download “all”