Galilée Institute / University Sorbonne Paris North
RCLN@LIPN
France
Developing lexico-semantic resources is a major issue in the Natural Language Processing field. These resources, by making explicit inter alia some knowledge possessed only by humans, aim at providing the ability of a precise and complete text understanding to NLP tasks. Popular resources-building strategies involving crowdsourcing are flowering in NLP and are proved to be successful. However, the resulted resources are not free of errors and lack some important semantic relations. In this PhD thesis, we used the french lexico-semantic network from the project JeuxDeMots as a case-study. We designed an endogenous consolidation system for this type of networks based on inferring and annotating new semantic relations using the already existing ones, as well as extracting and proposing inference rules able to (re)generate a considerable part of the network. In addition, we conceived a domain specific language for manipulating the consolidation system along with the network itself and a prototype was implemented.
TEXTE team of LIRMM (Montpellier Laboratory of Informatics, Robotics and Microelectronics) in Montpellier University & French National Center for Scientic Research (CNRS) (France)