Understanding human language with Python - Alyona Medelyan

Android Auto: hands-free operation via voice commands.

WordLense: “augmented reality translation”

“Her” movie in reality: “Siri will you marry me? Siri doesn’t seem to be available.”

Segmentation complexities.

Disambiguation complexities. E.g. Flying planes can be dangerous.

Categories - taxonomy terms, entities - biological/chemical, sentiment.

NKTK - Python toolkit for NLP.

How to get to the core words? Remove stopwords.

from nltk.corpus import stopwords
from nltk.tokenizie import word_tokenize

Keyword scoring: TFxIDF

Gensim:

from gensim import corpora, models

When ranking words can use score to discard them.

Text Categorisation with NLTK

Sentiment Analysis:

from textblob import TextBlob

http://deeplearning.net/software/theano (opens new window)

http://textblob.readthedocs.org (opens new window)

http://scikit-learn.org/stable (opens new window)

http://nltk.org (opens new window)

http://radimrehurek.com/gensim (opens new window)

http://github.com/zelandiya/KiwiPyCon-NLP-tutorial (opens new window)