A Lexicon based Approach Towards Concept Extraction

  • Anoud Shaikh
  • Naeem Ahmed Mahoto
  • Mukhtiar Ali Unar


The emergence of digital media has tremendously increased amount of unstructured data. Recently 80% of data, generated over web, is in unstructured format. This immense amount of data is a great source for the knowledge discovery and thus, may be utilized for extracting purposeful information. This study adopted lexicon based approach for automatic concept extraction from online news stories and events. An application prototype has been developed to demonstrate the applicability and effectiveness of the adopted approach. The extracted knowledge about news stories, articles and blogs is essential in understanding in-depth information for news analysts. This knowledge plays vital role towards building societies, since media is considered as an opinion maker for its audience.