Extracción automática de colocaciones terminológicas en un corpus extenso de lengua general

  1. Octavio Santana Suárez
  2. José Rafael Pérez Aguiar
  3. Isabel Sánchez Berriel
  4. Virginia Gutiérrez Rodríguez
Procesamiento del lenguaje natural

ISSN: 1135-5948

Ano de publicación: 2011

Número: 47

Páxinas: 145-152

Tipo: Artigo

Exportar: RIS


Citas recibidas

  • Citas en Dialnet Métricas: 1 (17-09-2021)

SCImago Journal Rank

  • Ano 2011
  • Impacto SJR da revista:
  • Cuartil maior:
  • Área: Computer Science Applications Cuartil: - Posición na área: 1784/1812
  • Área: Language and Linguistics Cuartil: - Posición na área: 615/654
  • Área: Linguistics and Language Cuartil: - Posición na área: 619/659


  • Ciencias Sociais: B
  • Ciencias Humanas: A


  • Ano 2011
  • CiteScore da revista: 0.0
  • Área: Language and Linguistics Percentil: 14
  • Área: Linguistics and Language Percentil: 13
  • Área: Computer Science Applications Percentil: 1


The automatic systems which deal with term’s extractions constitute an important tool when they make reference to the labor of compilation of lexemes, which is restricted to a specific field or specialty. The textual analysis that are realized for this type of software must include strategies that could detect collocations in the field in which is done. In this topic is studied the viability of the use from extensive textual’s corpus, that have not contain linguistic information, as happen with those textual’s corpus that could be compiled from internet. The internet is used like a source of information for the recompilation of terminology’s collocations. With that purpose is analyzed the behavior of different indicators based on the frequencies registered for a collection of economic terms in a Spanish corpus of 300.000 words.