July 15, 2010
In the Enterprise, extracting key entities such as names, locations, dates, and industry specific terminology from text can empower organizations to not only improve keyword search but also open the door to enhanced reasoning capabilities previously unavailable.
Using robust entity extraction technology allows for the automatic comprehension of words, sentences, paragraphs and whole documents. Categorization, extraction, domain establishment, taxonomy and ontology creation can then be built upon a Semantic Technology infra-structure to exploit the capabilities provided in OWL and RDF. This technology brings value to the Enterprise by allowing users to find, discover and create structured knowledge connections from what were previously unstructured information sets.
Expert System S.p.A. and Franz Inc. have paired their respective technologies to show how an advanced entity extractor and scalable RDF database can bring value to your project by unlocking previously unavailable knowledge from unstructured data.
The presentation will demonstrate how to represent output from the Cogito entity extractor in an AllegroGraph RDF database, and perform reasoning along with SPARQL queries. We will show an example of collecting 1000 articles scraping Google’s news page, and process them through Cogito to create RDF triples for use in AllegroGraph. The presentation will discuss tips and techniques in dealing with these representations, demonstrate how to relate entities to Linked Data such as DBpedia (or other external sources), and finally we will perform a number of queries on the resulting triple store data using some straight forward inferencing.
View a recording of the event here.
Download the presentation slides here.
About the presenters
Brooke Aker is the CEO of Expert System USA, a leading semantic technology firm, and is a long standing speaker and writer in the areas of Data & Knowledge Management and Predictive Analytics. Mr. Aker is a serial entrepreneur having formed both Acuity Software and Cipher Systems. Mr. Aker has worked with over 130 of the Global 2000 in the formation and operation of successful data management projects at such companies as Procter & Gamble, Johnson & Johnson and Petrobras. He has conducted numerous workshops and given speeches at Knowledge Management and IT conferences. His technology development of intelligence/data analytics systems has been recognized by and won awards from IBM, Strategy Magazine, and Fuld & Co. Mr. Aker earned a M.A. in Economics from Boston University and a B.A. in Economics from the University of Vermont.
Dr. Jans Aasman, CEO of Franz Inc. started his career as an experimental and cognitive psychologist, earning his PhD in cognitive science with a detailed model of car driver behavior using Lisp and Soar. He has spent most of his professional life in telecommunications research, specializing in intelligent user interfaces and applied artificial intelligence projects. From 1995 to 2004, he was also a part-time professor in the Industrial Design department of the Technical University of Delft. Jans is currently the CEO of Franz Inc., the leading supplier of commercial, persistent, and scalable RDF database products that provide the storage layer for powerful reasoning and ontology modeling capabilities for Semantic Web applications. Dr. Aasman has gained notoriety as a conference speaker at such events as Semantic Technologies Conference, International Semantic Web Conference, Java One, Linked Data Planet, INSA, GeoWeb, ICSC, RuleML and DEBS.