Extracción automática de colocaciones terminológicas en un corpus extenso de lengua general

  1. Octavio Santana Suárez
  2. José Rafael Pérez Aguiar
  3. Isabel Sánchez Berriel
  4. Virginia Gutiérrez Rodríguez
Revue:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Année de publication: 2011

Número: 47

Pages: 145-152

Type: Article

Exporter: RIS

Indicateurs

Cité par

  • Dialnet Métricas Cité par: 1 (17-09-2021)

CIRC

  • Ciencias Sociales: B
  • Ciencias Humanas: A

CiteScore

  • Año 2011
  • CiteScore de la revista: 0.0
  • Área: Language and Linguistics Percentil: 14
  • Área: Linguistics and Language Percentil: 13
  • Área: Computer Science Applications Percentil: 1

Résumé

The automatic systems which deal with term’s extractions constitute an important tool when they make reference to the labor of compilation of lexemes, which is restricted to a specific field or specialty. The textual analysis that are realized for this type of software must include strategies that could detect collocations in the field in which is done. In this topic is studied the viability of the use from extensive textual’s corpus, that have not contain linguistic information, as happen with those textual’s corpus that could be compiled from internet. The internet is used like a source of information for the recompilation of terminology’s collocations. With that purpose is analyzed the behavior of different indicators based on the frequencies registered for a collection of economic terms in a Spanish corpus of 300.000 words.