SLICE: Supersense-based Lightweight Interpretable Contextual Embeddings

Cindy Aloui; Carlos Ramisch; Alexis Nasr; Lucie Barque

Communication Dans Un Congrès Année : 2020

SLICE: Supersense-based Lightweight Interpretable Contextual Embeddings

(1) , (1) , (1) , (2, 3)

1
2
3

Cindy Aloui

Fonction : Auteur

Traitement Automatique du Langage Ecrit et Parlé

Carlos Ramisch

Fonction : Auteur
PersonId : 5103
IdHAL : carlos-ramisch
ORCID : 0000-0001-7466-9039
IdRef : 170720802

Traitement Automatique du Langage Ecrit et Parlé

Alexis Nasr

Fonction : Auteur
PersonId : 4991
IdHAL : alexis-nasr
IdRef : 120694220

Traitement Automatique du Langage Ecrit et Parlé

Lucie Barque

Fonction : Auteur
PersonId : 13601
IdHAL : lucie-barque
ORCID : 0000-0002-1806-848X
IdRef : 12513634X

Laboratoire de Linguistique Formelle

Université Sorbonne Paris Nord

Résumé

Contextualised embeddings such as BERT have become de facto state-of-the-art references in many NLP applications, thanks to their impressive performances. However, their opaqueness makes it hard to interpret their behaviour. SLICE is a hybrid model that combines supersense labels with contextual embeddings. We introduce a weakly supervised method to learn interpretable embeddings from raw corpora and small lists of seed words. Our model is able to represent both a word and its context as embeddings into the same compact space, whose dimensions correspond to interpretable supersenses. We assess the model in a task of supersense tagging for French nouns. The little amount of supervision required makes it particularly well suited for low-resourced scenarios. Thanks to its interpretability, we perform linguistic analyses about the predicted supersenses in terms of input word and context representations.

Domaines

Informatique et langage [cs.CL]

Fichier principal

main.pdf (238.94 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Carlos Ramisch : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03017741

Soumis le : samedi 21 novembre 2020-13:19:52

Dernière modification le : vendredi 22 mars 2024-18:24:04

Archivage à long terme le : lundi 22 février 2021-18:29:06

Dates et versions

hal-03017741 , version 1 (21-11-2020)

Identifiants

HAL Id : hal-03017741 , version 1

Citer

Cindy Aloui, Carlos Ramisch, Alexis Nasr, Lucie Barque. SLICE: Supersense-based Lightweight Interpretable Contextual Embeddings. The 28th International Conference on Computational Linguistics (COLING 2020), Dec 2020, Barcelona (on line), Spain. ⟨hal-03017741⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-PARIS13 UNIV-TLN CNRS UNIV-AMU LLF CAMPUS-AAR AAI USPC LIS-LAB SORBONNE-PARIS-NORD UP-SOCIETES-HUMANITES ANR INCIAM ACT-R

84 Consultations

118 Téléchargements

SLICE: Supersense-based Lightweight Interpretable Contextual Embeddings

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager