Objectives:
1. Explore Amazon reviews
2. Sentimentalize the reviews
3. Word frequency by helpfulness
Workshop Resources
Azure Notebooks Library
– Sentiment Notebook
– Commoners Notebook
More information
Datasets
http://jmcauley.ucsd.edu/data/amazon/ | Amazon reviews for NLP
http://mpqa.cs.pitt.edu/lexicons/effect_lexicon/ | +/- Effect Lexicon
Packages
http://nlp.johnsnowlabs.com/ | Spark Package for NLP
https://spark.apache.org/docs/latest/ml-guide.html | Spark ML guide – focus on DataFrame based, NOT RDD-based
Deprecated: Creation of dynamic property WP_Term::$cat_ID is deprecated in
/home/garrens3/public_html/blog/wp-includes/category.php on line
378
Deprecated: Creation of dynamic property WP_Term::$category_count is deprecated in
/home/garrens3/public_html/blog/wp-includes/category.php on line
379
Deprecated: Creation of dynamic property WP_Term::$category_description is deprecated in
/home/garrens3/public_html/blog/wp-includes/category.php on line
380
Deprecated: Creation of dynamic property WP_Term::$cat_name is deprecated in
/home/garrens3/public_html/blog/wp-includes/category.php on line
381
Deprecated: Creation of dynamic property WP_Term::$category_nicename is deprecated in
/home/garrens3/public_html/blog/wp-includes/category.php on line
382
Deprecated: Creation of dynamic property WP_Term::$category_parent is deprecated in
/home/garrens3/public_html/blog/wp-includes/category.php on line
383
Categories
Apache SparkTags
Data Science, IPython Notebook, Jupyter, Machine Learning, ML, Natural Language Processing, NLP, PySpark, Python, spark, Workshop