SBIR Phase I: Quantifying consumer rationale expressed in free text online discussions

Award Information

Agency: National Science Foundation

Branch: N/A

Contract: 1248768

Agency Tracking Number: 1248768

Amount: $150,000.00

Phase: Phase I

Program: SBIR

Solicitation Topic Code: EI

Solicitation Number: N/A

Timeline

Solicitation Year: 2012

Award Year: 2013

Award Start Date (Proposal Award Date): 2013-01-01

Award End Date (Contract End Date): 2013-12-31

Small Business Information

DMETRICS INC.

181 North 11th St

Brooklyn, NY 11211-1175

United States

DUNS: 800752441

HUBZone Owned: No

Woman Owned: No

Socially and Economically Disadvantaged: No

Principal Investigator

Name: Paul Nemirovsky
Phone: (617) 642-7163
Email: paul.nemirovsky@gmail.com

Business Contact

Name: Paul Nemirovsky
Phone: (617) 642-7163
Email: paul.nemirovsky@gmail.com

Research Institution

Name: Stub

Abstract

This Small Business Innovation Research (SBIR) Phase I project aims at developing a targeted semantic processing framework for the analysis of online conversations. A particular focus is given to improving the state-of-the art in the design of semi-supervised syntactic and semantic parsers capable of processing informal conversations about consumer products and services across industry verticals. This project will address some of the key shortcomings of existing natural language processing (NLP) frameworks when applied to noisy language typical of user-generated content, where linguistic phenomena is significantly divergent from the carefully edited content such as news reports, including colloquialisms, misspellings, grammatical errors, and incomplete sentences. Given the wide proliferation, amount, and richness of available user-generated content, combined with the limitations of the currently available shallow textual representations, creation of better NLP models is critical for extracting useful information from text that is largely informal, and for which no annotated data is available. Successful completion of this work will fulfill the urgent need for the development of NLP models accurate across multiple domains, introduce novel information extraction models for the extraction of consumer decisions, experiences, and rationale, while advancing our capability to model, quantify, and understand the reasons underlying product adoption and attrition. The broader impact/commercial potential of this project relates to the accuracy of automation that can be achieved when analyzing large amounts of unstructured text. Millions of Americans use the Internet to share, search and communicate information about products and services. Whether considering a new product, relaying past product experience, or analyzing product performance, consumers and companies alike are faced with an overwhelming range of informational sources, making the process of gathering information about products both slow and costly. Using in-depth text processing technology to analyze product reports found across social networks, forums, blogs, and company-driven websites will both increase the value of consumer-driven product reports and improve companies' and regulators' ability to learn from real-world product usage data to improve performance and public perception of marketed products and services. If successful, this project will lead to advancements in knowledge discovery, enable the automation of manual processes associated with large-scale consumer studies and change the dominant methodologies for conducting market research.

* Information listed above is at the time of submission. *

You are here

SBIR Phase I: Quantifying consumer rationale expressed in free text online discussions