Genre corpus and tree


The GENTT management tool makes it possible to feed and exploit a Corpus of texts on a cooperative and personalised basis. As it is an open, flexible and cooperative system, GENTT Corpus users can design and create their work to suit their individual needs and styles. The Corpus provides access to model texts that reflect the reality of specialised language, with essential and up-to-date information on the language in context.

The system recognises different user types: administrators, with access to all the functions of the Corpus; basic users, who can search for documents, create subcorpora, feed the Corpus and edit data they have created; and privileged users, who can also invite other users to participate in the tool. To make use of the tool’s functions according to their assigned role, users must register by invitation from an existing user or an administrator’s decision.

Once users are inside the system, they can perform a range of actions, depending on their role, some of which are listed below:

  • Consult data of various kinds: linguistic/terminological, conceptual, communicative situation-related, etc.
  • Upload documents
  • Download documents
  • Manage and edit documents
  • Work with subcorpora
  • Manage trees and taxonomies
  • Consult reports
  • Issue and manage invitations

The future poses several challenges for the GENTT Group; among other possibilities, there are plans for the Corpus to include connection channels to external linguistic analysis and terminology extraction tools, as well as a semi-automated text classification system.

The Corpus also includes a detailed classification of legal, medical and technical genre trees, as illustrated in the following images: