Android Auto: hands-free operation via voice commands.
WordLense: “augmented reality translation”
“Her” movie in reality: “Siri will you marry me? Siri doesn’t seem to be available.”
Segmentation complexities.
Disambiguation complexities. E.g. Flying planes can be dangerous.
Categories - taxonomy terms, entities - biological/chemical, sentiment.
NKTK - Python toolkit for NLP.
How to get to the core words? Remove stopwords.
from nltk.corpus import stopwords
from nltk.tokenizie import word_tokenize
Keyword scoring: TFxIDF
Gensim:
from gensim import corpora, models
When ranking words can use score to discard them.
Text Categorisation with NLTK
Sentiment Analysis:
from textblob import TextBlob
http://deeplearning.net/software/theano (opens new window)
http://textblob.readthedocs.org (opens new window)
http://scikit-learn.org/stable (opens new window)
http://nltk.org (opens new window)
http://radimrehurek.com/gensim (opens new window)
http://github.com/zelandiya/KiwiPyCon-NLP-tutorial (opens new window)