Analyse linguistique et extraction de collocations
|
Auteur(s)Dr Luka Nerima , Laboratoire d'analyse et de technologie du langage (LATL) , Département de linguistique , Faculté des lettres , Université de Genève (Unige) . Mme Violeta Seretan , Laboratoire d'analyse et de technologie du langage (LATL) , Département de linguistique , Faculté des lettres , Université de Genève (Unige) . |
Projet de recherche
Analyse linguistique et extraction de collocations
> voir le descriptif du projet
Mots-clés
Informatique - WebRésumé
Ce résumé n'existe qu'en anglais
This paper presents a method for extracting multi-word collocations (MWCs) from text corpora, which is based on the previous extraction of syntactically bound collocation bigrams. We describe an iterative word linking procedure which relies on a syntactic criterion and aims at building up arbitrarily long expressions that represent multi-word collocation candidates. We propose several measures to rank candidates according to the collocational strength, and we present the results of a trigram extraction experiment. The methodology used is particularly well-suited for the identification of those collocations whose terms are arbitrarily distant, due to syntactic processes (passivization, relativization, dislocation, topicalization).
Fichier(s)
Collocation Translation Based on Sentence Alignment and Parsing
Anglais | [127 ko] > télécharger |
Multi-Word Collocation Extraction by Syntactic Composition of Collocation Bigrams
Anglais | [1230 ko] > télécharger |
Multilingual Collocation Extraction: Issues and Solutions
Anglais | [132 ko] > télécharger |
Induction of Syntactic Collocation Patterns from Generic Syntactic Relations
Anglais | [14 ko] > télécharger |
Accurate Collocation Extraction Using a Multilingual Parser
Anglais | [126 ko] > télécharger |
Le problème des collocations en TAL
Français | [388 ko] > télécharger |
Syntactic-based Collocation Extraction from Parallel Corpora and from the Web (PowerPoint Presentation)
Anglais | [ ko] > télécharger |
A Tool for Multi-Word Collocation Extraction and Visualization in Multilingual Corpora
Anglais | [ ko] > télécharger |
Using the Web as a Corpus for the Syntactic-Based Collocation Identification
Anglais | [ ko] > télécharger |
Extraction of Multi-Word Collocations Using Syntactic Bigram Composition
Anglais | [159 ko] > télécharger |
Creating a Multilingual Collocation Dictionary from Large Text Corpora
Anglais | [60 ko] > télécharger |