Slide 1

Slide 1 text

Team – 22 M Manoj Kumar – Srinath Ravichandran - Dharmesh kakadia – Sandhya S (201107502) - (201107625) - (201107616) - (201107617) REVIEW SUMMARY SYSTEM

Slide 2

Slide 2 text

OVERVIEW •  System to summarize reviews from various sources •  Users can view and compare products based on features •  Results exposed as RESTful web-service •  Ability to cater to different products

Slide 3

Slide 3 text

OVERALL WORK FLOW Feature Extraction Sentiment Analysis Sentiment Classification

Slide 4

Slide 4 text

DETAILED FLOW CHART Reviews Parse and Tag Feature Extraction Feature DB Opinion DB •  Once for a category of product •  Nouns #frequency •  Adjectives #frequency •  Classifier is designed based on this data.

Slide 5

Slide 5 text

Review •  Raw Review Sentence Pruning •  Preprocess data •  List of valid features Dependency relations •  Using Stanford Parser Semantic Analyzer NoSQL (mongo) Feature DB •  Each sentence is passed through NLP logic. •  Features are extracted and rated according to the opinion of the setence.

Slide 6

Slide 6 text

PARALLELIZING WITH HADOOP Mapper Reducer Reducer Reducer mobile2 Summary Data Base mobille1 mobille2 mobille3 (Tag the Review)

Slide 7

Slide 7 text

DATABASE SCHEMA Trained Data •  Nouns # •  Modifiers # Tagged Reviews •  Features •  Ratings •  Review Text Review Summary •  Features •  Average Rating Product X

Slide 8

Slide 8 text

RESTFUL WEB SERVICES •  System exposes results as restful web services. Review System

Slide 9

Slide 9 text

EXPERIENCES & LEARNING •  NLP Dependency Relationships!!! •  REST is BEST •  SCHEMA defines EVERYTHING!!

Slide 10

Slide 10 text

FUTURE WORK •  Better feature Extraction. •  Synonym match can be extended with Wordnet::Similarity. •  Can be further optimized for blazing performance. •  Preprocess user query.

Slide 11

Slide 11 text

TOOLS USED •  NLP •  Stanford Parser •  Wordnet (Synonyms) •  Sentiwordnet •  Hadoop 20.2 •  Mongo DB