Automatic Extraction of Function Knowledge from Text

AbstractThis paper presents a method to automatically extract function knowledge from natural language text. Our method uses syntactic rules to extract subject-verb-object triplets from parsed text. We then leverage the Functional Basis taxonomy, WordNet, and word2vec to classify the triplets as artifact-function-energy flow knowledge. For evaluation, we compare the function definitions associated with 30 most frequent artifacts compiled in a human-constructed knowledge base, Oregon State University’s Design Repository (DR), to those extracted using our method from 4953 Wikipedia pages classified under the category “Machines”. Our method found function definitions for 66% of the test artifacts. For those artifacts found, our method identified 50% of the function definitions compiled in DR. In addition, 75% of the most frequent function definitions found by our method were also defined in DR. The results demonstrate the promising potential of our method in automatic extraction of function knowledge.

Download publication

Get in touch

Something pique your interest? Get in touch if you’d like to learn more about Autodesk Research, our projects, people, and potential collaboration opportunities.

Contact us