HW 5

profilesri169025

  

1. What are the main challenges of text analysis?


2. What is a corpus?


3. What are common words (such as a, and, of) called?


4. Why can't we use TF alone to measure the usefulness of the words?


5. What is a caveat of IDF? How does TFIDF address the problem?


6. Name three benefits of using the TFIDF.


7. What methods can be used for sentiment analysis?


8. Research and document additional use cases and actual implementations for Hadoop.


9. Compare and contrast Hadoop, Pig, Hive, and HBase. List strengths and weaknesses of each tool set.


10. Research and summarize three published use cases for Hadoop, Pig, Hive, and HBase.

  • 7 years ago
  • 25
Answer(2)

Purchase the answer to view it

blurred-text
NOT RATED
  • attachment
    Solution.docx

Purchase the answer to view it

blurred-text
NOT RATED
  • attachment
    Turnitin_Originality_Report_1185996763.html
  • attachment
    Textanalysis.docx
other Questions(10)