Solr is an open source, widely used, popular IR machine. It can be used for simple sentiment analysis and sentiment retrieval tool. Its multi-language analyzers together with UIMA (Unstructured Information Management Architecture) framework can be extended for sentiment extraction. Each sentence passes through a series of pluggable annotators. Entity and its associated polarity are detected for each sentence. Polarity of each sentence is stored into Solr index. Persistent model files can be created from training data and accessed at run time.
4. What is Sentiment Analysis?
A linguistic analysis technique that identifies
The movie is great.
The movie stars Mr. X
The movie is horrible.
opinion early in a piece of text.
9. Linguistics module
Stems, Lemmas and Synonyms
multi language capability
CJKAnalyzer, UIMA Analyzers
UIMA integration
UpdateProcessorChain
Why Solr ?
10. Why Solr ?
Extract domain specific entities
and concepts
Time and Cost
Solr Set Up â 5 mins
UIMA Annotators - 5 days
Enrich text, write to dedicated field
11. Tagging entities in review text
Usecase
I wasn't really in the market for another tablet, but my girlfriend ended
up getting one for me so she got me on this one. I would like to say that
this tablet reminds me of the first Motorola Droid smartphone that came
out several years back. The phone jam packed a ton of bells & whistles
into its hardware and software to give a lot of bang for your buck. This
is what it feels like amazon has done with the Kindle Fire 8.9. They have
put a lot of advanced hardware and innovative software, so for the
average user, specially someone who absorbs a lot of media, you get a
lot for the price. But just because you get a lot for the price, doesn't
mean it is without its flaws.
13. Digital SLR with Full 1080p HD Video
There are many preprogrammed scene modes
that make this a very easy camera to use.
The picture quality is beyond belief, and
even better for the price.
Price:
Usecase
14. Why UIMA ?
UIMA Framework manages components
and data flow â No coding
Deploy pipeline of analysis engines
AEs wrap NLP algorithms
Person
Place
organization
Language
Detection
Aggregate analysis engine
Sentence
Annotator
POS
Annotator
NER
16. NLP+UIMA
Use POS in query understanding
boosting terms
Synonym expansion
Extract concepts/entities
Faceting using entities
Identify places in query
and use spatial queries
17. Ideas: Sentiment Analysis App
Identify Subjective Sentences from text
Remove noisy sentences
â Regex, conditional probability
Graph min cut â LingPipe
Subjectivity Lexicons
Discard Facts and Objective Sentences