Entraîner un modèle…

Il y a plusieurs approches. Nous allons en présenter deux: w2v et glove.

Word2Vector

Documentation gensim

  • Continous bag of words (CBOW): utiliser le contexte comme input pour prévoir le mot cible.

  • Skip-gram: utiliser un mot comme input pour prévoir le contexte. Article original

Skip-gram

  1. One hot encoding: un vecteur du même nombre des dimension que la taille du vocabulaire.

\[\begin{split} \begin{bmatrix} 0\\ 0\\ 0\\ 1\\ 0\\ \vdots \\ 0\\ \end{bmatrix} \end{split}\]
  1. une couche cachée

  2. un output (une série d’autres one hot encodings)

Le but est d’ajuster les poids pour ensuite pouvoir représenter le mot avec la cocuhe cachée. Donc la tâche qu,on donne au réseau ne sert pas à entraîner un modèle capable ensuite de réaliser cette tâche (deviner les mots du contexte), mais cela servira simplement à représenter le mot.

Concrètement une ligne de la couche cachée sera notre vecteur:

http://mccormickml.com/assets/word2vec/matrix_mult_w_one_hot.png

L’idée est que si deux mots se trouvent statistiquemnt dans des contextes très similaires ils doivent être proches.

from gensim.models import Word2Vec
with open('fichiers/recherche.txt', 'r') as file:
    recherche = file.read()
recherche = recherche.replace('\xa0', '')
recherche = recherche.replace('\n', ' ')
sentences = [s for s in recherche.split('.')]
   
sentences = [s.split(' ') for s in sentences]
model = Word2Vec(sentences=sentences)
model.wv.vocab
{'': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2550>,
 'À': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2588>,
 'la': <gensim.models.keyedvectors.Vocab at 0x7f170ceb25c0>,
 'recherche': <gensim.models.keyedvectors.Vocab at 0x7f170ceb25f8>,
 'du': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2630>,
 'temps': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2668>,
 'perdu': <gensim.models.keyedvectors.Vocab at 0x7f170ceb26a0>,
 'Proust': <gensim.models.keyedvectors.Vocab at 0x7f170ceb26d8>,
 'de': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2710>,
 'le': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2748>,
 'MARCEL': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2780>,
 'PROUST': <gensim.models.keyedvectors.Vocab at 0x7f170ceb27b8>,
 'LA': <gensim.models.keyedvectors.Vocab at 0x7f170ceb27f0>,
 'RECHERCHE': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2828>,
 'DU': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2860>,
 'TEMPS': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2898>,
 'PERDU': <gensim.models.keyedvectors.Vocab at 0x7f170ceb28d0>,
 'nrf': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2908>,
 'GALLIMARD': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2940>,
 '-': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2978>,
 '1': <gensim.models.keyedvectors.Vocab at 0x7f170ceb29b0>,
 'Du': <gensim.models.keyedvectors.Vocab at 0x7f170ceb29e8>,
 'côté': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2a20>,
 'chez': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2a58>,
 'Swann': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2a90>,
 '2': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2ac8>,
 'l’ombre': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2b00>,
 'des': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2b38>,
 'jeunes': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2b70>,
 'filles': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2ba8>,
 'en': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2be0>,
 'fleurs': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2c18>,
 'Le': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2c50>,
 'Guermantes': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2c88>,
 'Sodome': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2cc0>,
 'et': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2cf8>,
 'Gomorrhe': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2d30>,
 'La': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2d68>,
 '6': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2da0>,
 'Albertine': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2dd8>,
 'Temps': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2e10>,
 'retrouvé': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2e48>,
 'DE': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2e80>,
 'Comme': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2eb8>,
 'un': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2ef0>,
 'témoignage': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2f28>,
 'profonde': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2f60>,
 'reconnaissance': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2f98>,
 'Première': <gensim.models.keyedvectors.Vocab at 0x7f170ceb2fd0>,
 'Combray': <gensim.models.keyedvectors.Vocab at 0x7f170cb16048>,
 'Un': <gensim.models.keyedvectors.Vocab at 0x7f170cb16080>,
 'amour': <gensim.models.keyedvectors.Vocab at 0x7f170cb160b8>,
 'nom': <gensim.models.keyedvectors.Vocab at 0x7f170cb160f0>,
 'PARTIE': <gensim.models.keyedvectors.Vocab at 0x7f170cb16128>,
 'je': <gensim.models.keyedvectors.Vocab at 0x7f170cb16160>,
 'me': <gensim.models.keyedvectors.Vocab at 0x7f170cb16198>,
 'suis': <gensim.models.keyedvectors.Vocab at 0x7f170cb161d0>,
 'couché': <gensim.models.keyedvectors.Vocab at 0x7f170cb16208>,
 'bonne': <gensim.models.keyedvectors.Vocab at 0x7f170cb16240>,
 'heure': <gensim.models.keyedvectors.Vocab at 0x7f170cb16278>,
 'Parfois,': <gensim.models.keyedvectors.Vocab at 0x7f170cb162b0>,
 'à': <gensim.models.keyedvectors.Vocab at 0x7f170cb162e8>,
 'peine': <gensim.models.keyedvectors.Vocab at 0x7f170cb16320>,
 'ma': <gensim.models.keyedvectors.Vocab at 0x7f170cb16358>,
 'bougie': <gensim.models.keyedvectors.Vocab at 0x7f170cb16390>,
 'mes': <gensim.models.keyedvectors.Vocab at 0x7f170cb163c8>,
 'yeux': <gensim.models.keyedvectors.Vocab at 0x7f170cb16400>,
 'se': <gensim.models.keyedvectors.Vocab at 0x7f170cb16438>,
 'fermaient': <gensim.models.keyedvectors.Vocab at 0x7f170cb16470>,
 'si': <gensim.models.keyedvectors.Vocab at 0x7f170cb164a8>,
 'vite': <gensim.models.keyedvectors.Vocab at 0x7f170cb164e0>,
 'que': <gensim.models.keyedvectors.Vocab at 0x7f170cb16518>,
 'n’avais': <gensim.models.keyedvectors.Vocab at 0x7f170cb16550>,
 'pas': <gensim.models.keyedvectors.Vocab at 0x7f170cb16588>,
 'dire:': <gensim.models.keyedvectors.Vocab at 0x7f170cb165c0>,
 '«Je': <gensim.models.keyedvectors.Vocab at 0x7f170cb165f8>,
 '»': <gensim.models.keyedvectors.Vocab at 0x7f170cb16668>,
 'Et,': <gensim.models.keyedvectors.Vocab at 0x7f170cb166a0>,
 'une': <gensim.models.keyedvectors.Vocab at 0x7f170cb166d8>,
 'demi-heure': <gensim.models.keyedvectors.Vocab at 0x7f170cb16710>,
 'après,': <gensim.models.keyedvectors.Vocab at 0x7f170cb16748>,
 'pensée': <gensim.models.keyedvectors.Vocab at 0x7f170cb16780>,
 'qu’il': <gensim.models.keyedvectors.Vocab at 0x7f170cb167b8>,
 'était': <gensim.models.keyedvectors.Vocab at 0x7f170cb167f0>,
 'chercher': <gensim.models.keyedvectors.Vocab at 0x7f170cb16828>,
 'sommeil': <gensim.models.keyedvectors.Vocab at 0x7f170cb16860>,
 'voulais': <gensim.models.keyedvectors.Vocab at 0x7f170cb16898>,
 'poser': <gensim.models.keyedvectors.Vocab at 0x7f170cb168d0>,
 'volume': <gensim.models.keyedvectors.Vocab at 0x7f170cb16908>,
 'croyais': <gensim.models.keyedvectors.Vocab at 0x7f170cb16940>,
 'avoir': <gensim.models.keyedvectors.Vocab at 0x7f170cb16978>,
 'encore': <gensim.models.keyedvectors.Vocab at 0x7f170cb169b0>,
 'dans': <gensim.models.keyedvectors.Vocab at 0x7f170cb169e8>,
 'les': <gensim.models.keyedvectors.Vocab at 0x7f170cb16a20>,
 'mains': <gensim.models.keyedvectors.Vocab at 0x7f170cb16a58>,
 'cessé': <gensim.models.keyedvectors.Vocab at 0x7f170cb16a90>,
 'dormant': <gensim.models.keyedvectors.Vocab at 0x7f170cb16ac8>,
 'faire': <gensim.models.keyedvectors.Vocab at 0x7f170cb16b00>,
 'réflexions': <gensim.models.keyedvectors.Vocab at 0x7f170cb16b38>,
 'sur': <gensim.models.keyedvectors.Vocab at 0x7f170cb16b70>,
 'ce': <gensim.models.keyedvectors.Vocab at 0x7f170cb16ba8>,
 'venais': <gensim.models.keyedvectors.Vocab at 0x7f170cb16be0>,
 'lire,': <gensim.models.keyedvectors.Vocab at 0x7f170cb16c18>,
 'mais': <gensim.models.keyedvectors.Vocab at 0x7f170cb16c50>,
 'ces': <gensim.models.keyedvectors.Vocab at 0x7f170cb16c88>,
 'avaient': <gensim.models.keyedvectors.Vocab at 0x7f170cb16cc0>,
 'pris': <gensim.models.keyedvectors.Vocab at 0x7f170cb16cf8>,
 'tour': <gensim.models.keyedvectors.Vocab at 0x7f170cb16d30>,
 'peu': <gensim.models.keyedvectors.Vocab at 0x7f170cb16d68>,
 'particulier;': <gensim.models.keyedvectors.Vocab at 0x7f170cb16da0>,
 'il': <gensim.models.keyedvectors.Vocab at 0x7f170cb16dd8>,
 'semblait': <gensim.models.keyedvectors.Vocab at 0x7f170cb16e10>,
 'j’étais': <gensim.models.keyedvectors.Vocab at 0x7f170cb16e48>,
 'moi-même': <gensim.models.keyedvectors.Vocab at 0x7f170cb16e80>,
 'dont': <gensim.models.keyedvectors.Vocab at 0x7f170cb16eb8>,
 'parlait': <gensim.models.keyedvectors.Vocab at 0x7f170cb16ef0>,
 'église,': <gensim.models.keyedvectors.Vocab at 0x7f170cb16f28>,
 'François': <gensim.models.keyedvectors.Vocab at 0x7f170cb16f60>,
 'I^(er)': <gensim.models.keyedvectors.Vocab at 0x7f170cb16f98>,
 'Cette': <gensim.models.keyedvectors.Vocab at 0x7f170cb16fd0>,
 'croyance': <gensim.models.keyedvectors.Vocab at 0x7f170c9fd048>,
 'survivait': <gensim.models.keyedvectors.Vocab at 0x7f170c9fd080>,
 'pendant': <gensim.models.keyedvectors.Vocab at 0x7f170c9fd0b8>,
 'quelques': <gensim.models.keyedvectors.Vocab at 0x7f170c9fd0f0>,
 'secondes': <gensim.models.keyedvectors.Vocab at 0x7f170c9fd128>,
 'mon': <gensim.models.keyedvectors.Vocab at 0x7f170c9fd160>,
 'elle': <gensim.models.keyedvectors.Vocab at 0x7f170c9fd198>,
 'ne': <gensim.models.keyedvectors.Vocab at 0x7f170c9fd1d0>,
 'raison,': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1550>,
 'pesait': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1588>,
 'comme': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1518>,
 'empêchait': <gensim.models.keyedvectors.Vocab at 0x7f170cfc14a8>,
 'rendre': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1710>,
 'compte': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1748>,
 'n’était': <gensim.models.keyedvectors.Vocab at 0x7f170cfc17b8>,
 'allumé': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1470>,
 'Puis': <gensim.models.keyedvectors.Vocab at 0x7f170cfc16a0>,
 'commençait': <gensim.models.keyedvectors.Vocab at 0x7f170cfc15c0>,
 'devenir': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1630>,
 'après': <gensim.models.keyedvectors.Vocab at 0x7f170cfc11d0>,
 'pensées': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1908>,
 'd’une': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1828>,
 'existence': <gensim.models.keyedvectors.Vocab at 0x7f170cfc16d8>,
 'sujet': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1780>,
 'livre': <gensim.models.keyedvectors.Vocab at 0x7f170cfc18d0>,
 'détachait': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1978>,
 'moi,': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1940>,
 'libre': <gensim.models.keyedvectors.Vocab at 0x7f170cfc19b0>,
 'm’y': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1a20>,
 'appliquer': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1a58>,
 'ou': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1a90>,
 'non;': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1ac8>,
 'aussitôt': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1b00>,
 'vue': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1b38>,
 'bien': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1b70>,
 'étonné': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1ba8>,
 'trouver': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1be0>,
 'autour': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1c18>,
 'moi': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1c50>,
 'douce': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1c88>,
 'pour': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1cc0>,
 'yeux,': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1cf8>,
 'peut-être': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1d30>,
 'plus': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1d68>,
 'esprit,': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1da0>,
 'qui': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1dd8>,
 'apparaissait': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1e10>,
 'chose': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1e48>,
 'sans': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1e80>,
 'cause,': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1eb8>,
 'incompréhensible,': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1ef0>,
 'vraiment': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1f28>,
 'obscure': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1f60>,
 'Je': <gensim.models.keyedvectors.Vocab at 0x7f1738c77978>,
 'demandais': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1400>,
 'quelle': <gensim.models.keyedvectors.Vocab at 0x7f170cfc12e8>,
 'pouvait': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1358>,
 'j’entendais': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1240>,
 'trains': <gensim.models.keyedvectors.Vocab at 0x7f170cfc12b0>,
 'qui,': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1898>,
 'moins': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1f98>,
 'éloigné,': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1fd0>,
 'chant': <gensim.models.keyedvectors.Vocab at 0x7f170cfc1668>,
 'd’un': <gensim.models.keyedvectors.Vocab at 0x7f17081df9b0>,
 'oiseau': <gensim.models.keyedvectors.Vocab at 0x7f17081df9e8>,
 'forêt,': <gensim.models.keyedvectors.Vocab at 0x7f17081dfa20>,
 'relevant': <gensim.models.keyedvectors.Vocab at 0x7f17081dfa58>,
 'distances,': <gensim.models.keyedvectors.Vocab at 0x7f17081dfa90>,
 'décrivait': <gensim.models.keyedvectors.Vocab at 0x7f17081dfac8>,
 'l’étendue': <gensim.models.keyedvectors.Vocab at 0x7f17081dfb00>,
 'campagne': <gensim.models.keyedvectors.Vocab at 0x7f17081dfb38>,
 'où': <gensim.models.keyedvectors.Vocab at 0x7f17081dfb70>,
 'voyageur': <gensim.models.keyedvectors.Vocab at 0x7f17081dfba8>,
 'hâte': <gensim.models.keyedvectors.Vocab at 0x7f17081dfbe0>,
 'vers': <gensim.models.keyedvectors.Vocab at 0x7f17081dfc18>,
 'station': <gensim.models.keyedvectors.Vocab at 0x7f17081dfc50>,
 'petit': <gensim.models.keyedvectors.Vocab at 0x7f17081dfc88>,
 'chemin': <gensim.models.keyedvectors.Vocab at 0x7f17081dfcc0>,
 'suit': <gensim.models.keyedvectors.Vocab at 0x7f17081dfcf8>,
 'va': <gensim.models.keyedvectors.Vocab at 0x7f17081dfd30>,
 'être': <gensim.models.keyedvectors.Vocab at 0x7f17081dfd68>,
 'son': <gensim.models.keyedvectors.Vocab at 0x7f17081dfda0>,
 'souvenir': <gensim.models.keyedvectors.Vocab at 0x7f17081dfdd8>,
 'par': <gensim.models.keyedvectors.Vocab at 0x7f17081dfe10>,
 'doit': <gensim.models.keyedvectors.Vocab at 0x7f17081dfe48>,
 'lieux': <gensim.models.keyedvectors.Vocab at 0x7f17081dfe80>,
 'nouveaux,': <gensim.models.keyedvectors.Vocab at 0x7f17081dfeb8>,
 'actes': <gensim.models.keyedvectors.Vocab at 0x7f17081dfef0>,
 'causerie': <gensim.models.keyedvectors.Vocab at 0x7f17081dff28>,
 'récente': <gensim.models.keyedvectors.Vocab at 0x7f17081dff60>,
 'aux': <gensim.models.keyedvectors.Vocab at 0x7f17081dff98>,
 'adieux': <gensim.models.keyedvectors.Vocab at 0x7f17081dffd0>,
 'sous': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6048>,
 'lampe': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6080>,
 'étrangère': <gensim.models.keyedvectors.Vocab at 0x7f170c9a60b8>,
 'suivent': <gensim.models.keyedvectors.Vocab at 0x7f170c9a60f0>,
 'silence': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6128>,
 'nuit,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6160>,
 'douceur': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6198>,
 'prochaine': <gensim.models.keyedvectors.Vocab at 0x7f170c9a61d0>,
 'retour': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6208>,
 'tendrement': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6240>,
 'joues': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6278>,
 'contre': <gensim.models.keyedvectors.Vocab at 0x7f170c9a62b0>,
 'belles': <gensim.models.keyedvectors.Vocab at 0x7f170c9a62e8>,
 'pleines': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6320>,
 'sont': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6358>,
 'notre': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6390>,
 'enfance': <gensim.models.keyedvectors.Vocab at 0x7f170c9a63c8>,
 'regarder': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6400>,
 'montre': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6438>,
 'Bientôt': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6470>,
 'minuit': <gensim.models.keyedvectors.Vocab at 0x7f170c9a64a8>,
 'C’est': <gensim.models.keyedvectors.Vocab at 0x7f170c9a64e0>,
 'l’instant': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6518>,
 'malade': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6550>,
 'a': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6588>,
 'été': <gensim.models.keyedvectors.Vocab at 0x7f170c9a65c0>,
 'obligé': <gensim.models.keyedvectors.Vocab at 0x7f170c9a65f8>,
 'partir': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6630>,
 'voyage': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6668>,
 'dû': <gensim.models.keyedvectors.Vocab at 0x7f170c9a66a0>,
 'coucher': <gensim.models.keyedvectors.Vocab at 0x7f170c9a66d8>,
 'hôtel': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6710>,
 'inconnu,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6748>,
 'réveillé': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6780>,
 'apercevant': <gensim.models.keyedvectors.Vocab at 0x7f170c9a67b8>,
 'porte': <gensim.models.keyedvectors.Vocab at 0x7f170c9a67f0>,
 'raie': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6828>,
 'jour': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6860>,
 'Quel': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6898>,
 'c’est': <gensim.models.keyedvectors.Vocab at 0x7f170c9a68d0>,
 'déjà': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6908>,
 'Dans': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6940>,
 'moment': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6978>,
 'domestiques': <gensim.models.keyedvectors.Vocab at 0x7f170c9a69b0>,
 'seront': <gensim.models.keyedvectors.Vocab at 0x7f170c9a69e8>,
 'pourra': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6a20>,
 'sonner,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6a58>,
 'on': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6a90>,
 'viendra': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6ac8>,
 'lui': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6b00>,
 'porter': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6b38>,
 'secours': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6b70>,
 'd’être': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6ba8>,
 'soulagé': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6be0>,
 'donne': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6c18>,
 'courage': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6c50>,
 'souffrir': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6c88>,
 'Justement': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6cc0>,
 'cru': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6cf8>,
 'entendre': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6d30>,
 'pas;': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6d68>,
 'puis': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6da0>,
 'Et': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6dd8>,
 'sa': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6e10>,
 'disparu': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6e48>,
 'vient': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6e80>,
 'd’éteindre': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6eb8>,
 'dernier': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6ef0>,
 'domestique': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6f28>,
 'est': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6f60>,
 'parti': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6f98>,
 'faudra': <gensim.models.keyedvectors.Vocab at 0x7f170c9a6fd0>,
 'rester': <gensim.models.keyedvectors.Vocab at 0x7f170c993048>,
 'toute': <gensim.models.keyedvectors.Vocab at 0x7f170c993080>,
 'nuit': <gensim.models.keyedvectors.Vocab at 0x7f170c9930b8>,
 'remède': <gensim.models.keyedvectors.Vocab at 0x7f170c9930f0>,
 'parfois': <gensim.models.keyedvectors.Vocab at 0x7f170c993128>,
 'courts': <gensim.models.keyedvectors.Vocab at 0x7f170c993160>,
 'instant,': <gensim.models.keyedvectors.Vocab at 0x7f170c993198>,
 'd’entendre': <gensim.models.keyedvectors.Vocab at 0x7f170c9931d0>,
 'd’ouvrir': <gensim.models.keyedvectors.Vocab at 0x7f170c993208>,
 'fixer': <gensim.models.keyedvectors.Vocab at 0x7f170c993240>,
 'kaléidoscope': <gensim.models.keyedvectors.Vocab at 0x7f170c993278>,
 'l’obscurité,': <gensim.models.keyedvectors.Vocab at 0x7f170c9932b0>,
 'goûter': <gensim.models.keyedvectors.Vocab at 0x7f170c9932e8>,
 'grâce': <gensim.models.keyedvectors.Vocab at 0x7f170c993320>,
 'lueur': <gensim.models.keyedvectors.Vocab at 0x7f170c993358>,
 'momentanée': <gensim.models.keyedvectors.Vocab at 0x7f170c993390>,
 'conscience': <gensim.models.keyedvectors.Vocab at 0x7f170c9933c8>,
 'étaient': <gensim.models.keyedvectors.Vocab at 0x7f170c993400>,
 'plongés': <gensim.models.keyedvectors.Vocab at 0x7f170c993438>,
 'meubles,': <gensim.models.keyedvectors.Vocab at 0x7f170c993470>,
 'chambre,': <gensim.models.keyedvectors.Vocab at 0x7f170c9934a8>,
 'tout': <gensim.models.keyedvectors.Vocab at 0x7f170c9934e0>,
 'n’étais': <gensim.models.keyedvectors.Vocab at 0x7f170c993518>,
 'qu’une': <gensim.models.keyedvectors.Vocab at 0x7f170c993550>,
 'petite': <gensim.models.keyedvectors.Vocab at 0x7f170c993588>,
 'partie': <gensim.models.keyedvectors.Vocab at 0x7f170c9935c0>,
 'duquel': <gensim.models.keyedvectors.Vocab at 0x7f170c9935f8>,
 'retournais': <gensim.models.keyedvectors.Vocab at 0x7f170c993630>,
 'Ou': <gensim.models.keyedvectors.Vocab at 0x7f170c993668>,
 'j’avais': <gensim.models.keyedvectors.Vocab at 0x7f170c9936a0>,
 'rejoint': <gensim.models.keyedvectors.Vocab at 0x7f170c9936d8>,
 'effort': <gensim.models.keyedvectors.Vocab at 0x7f170c993710>,
 'âge': <gensim.models.keyedvectors.Vocab at 0x7f170c993748>,
 'jamais': <gensim.models.keyedvectors.Vocab at 0x7f170c993780>,
 'vie': <gensim.models.keyedvectors.Vocab at 0x7f170c9937b8>,
 'primitive,': <gensim.models.keyedvectors.Vocab at 0x7f170c9937f0>,
 'telle': <gensim.models.keyedvectors.Vocab at 0x7f170c993828>,
 'celle': <gensim.models.keyedvectors.Vocab at 0x7f170c993860>,
 'grand-oncle': <gensim.models.keyedvectors.Vocab at 0x7f170c993898>,
 'boucles': <gensim.models.keyedvectors.Vocab at 0x7f170c9938d0>,
 'qu’avait': <gensim.models.keyedvectors.Vocab at 0x7f170c993908>,
 'dissipée': <gensim.models.keyedvectors.Vocab at 0x7f170c993940>,
 '—': <gensim.models.keyedvectors.Vocab at 0x7f170c993978>,
 'date': <gensim.models.keyedvectors.Vocab at 0x7f170c9939b0>,
 'nouvelle': <gensim.models.keyedvectors.Vocab at 0x7f170c9939e8>,
 'avait': <gensim.models.keyedvectors.Vocab at 0x7f170c993a20>,
 'J’avais': <gensim.models.keyedvectors.Vocab at 0x7f170c993a58>,
 'oublié': <gensim.models.keyedvectors.Vocab at 0x7f170c993a90>,
 'cet': <gensim.models.keyedvectors.Vocab at 0x7f170c993ac8>,
 'événement': <gensim.models.keyedvectors.Vocab at 0x7f170c993b00>,
 'sommeil,': <gensim.models.keyedvectors.Vocab at 0x7f170c993b38>,
 'j’en': <gensim.models.keyedvectors.Vocab at 0x7f170c993b70>,
 'retrouvais': <gensim.models.keyedvectors.Vocab at 0x7f170c993ba8>,
 'réussi': <gensim.models.keyedvectors.Vocab at 0x7f170c993be0>,
 'm’éveiller': <gensim.models.keyedvectors.Vocab at 0x7f170c993c18>,
 'échapper': <gensim.models.keyedvectors.Vocab at 0x7f170c993c50>,
 'grand-oncle,': <gensim.models.keyedvectors.Vocab at 0x7f170c993c88>,
 'mesure': <gensim.models.keyedvectors.Vocab at 0x7f170c993cc0>,
 'précaution': <gensim.models.keyedvectors.Vocab at 0x7f170c993cf8>,
 'complètement': <gensim.models.keyedvectors.Vocab at 0x7f170c993d30>,
 'tête': <gensim.models.keyedvectors.Vocab at 0x7f170c993d68>,
 'oreiller': <gensim.models.keyedvectors.Vocab at 0x7f170c993da0>,
 'avant': <gensim.models.keyedvectors.Vocab at 0x7f170c993dd8>,
 'retourner': <gensim.models.keyedvectors.Vocab at 0x7f170c993e10>,
 'monde': <gensim.models.keyedvectors.Vocab at 0x7f170c993e48>,
 'rêves': <gensim.models.keyedvectors.Vocab at 0x7f170c993e80>,
 'Quelquefois,': <gensim.models.keyedvectors.Vocab at 0x7f170c993eb8>,
 'côte': <gensim.models.keyedvectors.Vocab at 0x7f170c993ef0>,
 'femme': <gensim.models.keyedvectors.Vocab at 0x7f170c993f28>,
 'fausse': <gensim.models.keyedvectors.Vocab at 0x7f170c993f60>,
 'position': <gensim.models.keyedvectors.Vocab at 0x7f170c993f98>,
 'cuisse': <gensim.models.keyedvectors.Vocab at 0x7f170c993fd0>,
 'plaisir': <gensim.models.keyedvectors.Vocab at 0x7f170c996048>,
 'point': <gensim.models.keyedvectors.Vocab at 0x7f170c996080>,
 'goûter,': <gensim.models.keyedvectors.Vocab at 0x7f170c9960b8>,
 'c’était': <gensim.models.keyedvectors.Vocab at 0x7f170c9960f0>,
 'Mon': <gensim.models.keyedvectors.Vocab at 0x7f170c996128>,
 'corps': <gensim.models.keyedvectors.Vocab at 0x7f170c996160>,
 'sentait': <gensim.models.keyedvectors.Vocab at 0x7f170c996198>,
 'sien': <gensim.models.keyedvectors.Vocab at 0x7f170c9961d0>,
 'propre': <gensim.models.keyedvectors.Vocab at 0x7f170c996208>,
 'chaleur': <gensim.models.keyedvectors.Vocab at 0x7f170c996240>,
 'voulait': <gensim.models.keyedvectors.Vocab at 0x7f170c996278>,
 's’y': <gensim.models.keyedvectors.Vocab at 0x7f170c9962b0>,
 'rejoindre,': <gensim.models.keyedvectors.Vocab at 0x7f170c9962e8>,
 'reste': <gensim.models.keyedvectors.Vocab at 0x7f170c996320>,
 'humains': <gensim.models.keyedvectors.Vocab at 0x7f170c996358>,
 'm’apparaissait': <gensim.models.keyedvectors.Vocab at 0x7f170c996390>,
 'lointain': <gensim.models.keyedvectors.Vocab at 0x7f170c9963c8>,
 'auprès': <gensim.models.keyedvectors.Vocab at 0x7f170c996400>,
 'cette': <gensim.models.keyedvectors.Vocab at 0x7f170c996438>,
 'y': <gensim.models.keyedvectors.Vocab at 0x7f170c996470>,
 'moments': <gensim.models.keyedvectors.Vocab at 0x7f170c9964a8>,
 'joue': <gensim.models.keyedvectors.Vocab at 0x7f170c9964e0>,
 'chaude': <gensim.models.keyedvectors.Vocab at 0x7f170c996518>,
 'baiser,': <gensim.models.keyedvectors.Vocab at 0x7f170c996550>,
 'poids': <gensim.models.keyedvectors.Vocab at 0x7f170c996588>,
 'taille': <gensim.models.keyedvectors.Vocab at 0x7f170c9965c0>,
 'Si,': <gensim.models.keyedvectors.Vocab at 0x7f170c9965f8>,
 'arrivait': <gensim.models.keyedvectors.Vocab at 0x7f170c996630>,
 'quelquefois,': <gensim.models.keyedvectors.Vocab at 0x7f170c996668>,
 'traits': <gensim.models.keyedvectors.Vocab at 0x7f170c9966a0>,
 'connue': <gensim.models.keyedvectors.Vocab at 0x7f170c9966d8>,
 'vie,': <gensim.models.keyedvectors.Vocab at 0x7f170c996710>,
 'j’allais': <gensim.models.keyedvectors.Vocab at 0x7f170c996748>,
 'donner': <gensim.models.keyedvectors.Vocab at 0x7f170c996780>,
 'retrouver,': <gensim.models.keyedvectors.Vocab at 0x7f170c9967b8>,
 'ceux': <gensim.models.keyedvectors.Vocab at 0x7f170c9967f0>,
 'partent': <gensim.models.keyedvectors.Vocab at 0x7f170c996828>,
 'voir': <gensim.models.keyedvectors.Vocab at 0x7f170c996860>,
 'leurs': <gensim.models.keyedvectors.Vocab at 0x7f170c996898>,
 'cité': <gensim.models.keyedvectors.Vocab at 0x7f170c9968d0>,
 'désirée': <gensim.models.keyedvectors.Vocab at 0x7f170c996908>,
 's’imaginent': <gensim.models.keyedvectors.Vocab at 0x7f170c996940>,
 'qu’on': <gensim.models.keyedvectors.Vocab at 0x7f170c996978>,
 'peut': <gensim.models.keyedvectors.Vocab at 0x7f170c9969b0>,
 'réalité': <gensim.models.keyedvectors.Vocab at 0x7f170c9969e8>,
 'charme': <gensim.models.keyedvectors.Vocab at 0x7f170c996a20>,
 'songe': <gensim.models.keyedvectors.Vocab at 0x7f170c996a58>,
 'Peu': <gensim.models.keyedvectors.Vocab at 0x7f170c996a90>,
 'fille': <gensim.models.keyedvectors.Vocab at 0x7f170c996ac8>,
 'rêve': <gensim.models.keyedvectors.Vocab at 0x7f170c996b00>,
 'homme': <gensim.models.keyedvectors.Vocab at 0x7f170c996b38>,
 'dort': <gensim.models.keyedvectors.Vocab at 0x7f170c996b70>,
 'tient': <gensim.models.keyedvectors.Vocab at 0x7f170c996ba8>,
 'cercle': <gensim.models.keyedvectors.Vocab at 0x7f170c996be0>,
 'fil': <gensim.models.keyedvectors.Vocab at 0x7f170c996c18>,
 'heures,': <gensim.models.keyedvectors.Vocab at 0x7f170c996c50>,
 'l’ordre': <gensim.models.keyedvectors.Vocab at 0x7f170c996c88>,
 'années': <gensim.models.keyedvectors.Vocab at 0x7f170c996cc0>,
 'mondes': <gensim.models.keyedvectors.Vocab at 0x7f170c996cf8>,
 'Il': <gensim.models.keyedvectors.Vocab at 0x7f170c996d30>,
 'd’instinct': <gensim.models.keyedvectors.Vocab at 0x7f170c996d68>,
 'lit': <gensim.models.keyedvectors.Vocab at 0x7f170c996da0>,
 'seconde': <gensim.models.keyedvectors.Vocab at 0x7f170c996dd8>,
 'terre': <gensim.models.keyedvectors.Vocab at 0x7f170c996e10>,
 's’est': <gensim.models.keyedvectors.Vocab at 0x7f170c996e48>,
 'écoulé': <gensim.models.keyedvectors.Vocab at 0x7f170c996e80>,
 'jusqu’à': <gensim.models.keyedvectors.Vocab at 0x7f170c996eb8>,
 'rangs': <gensim.models.keyedvectors.Vocab at 0x7f170c996ef0>,
 'peuvent': <gensim.models.keyedvectors.Vocab at 0x7f170c996f28>,
 'rompre': <gensim.models.keyedvectors.Vocab at 0x7f170c996f60>,
 'Que': <gensim.models.keyedvectors.Vocab at 0x7f170c996f98>,
 'matin,': <gensim.models.keyedvectors.Vocab at 0x7f170c996fd0>,
 'quelque': <gensim.models.keyedvectors.Vocab at 0x7f170c99a048>,
 'prenne': <gensim.models.keyedvectors.Vocab at 0x7f170c99a080>,
 'train': <gensim.models.keyedvectors.Vocab at 0x7f170c99a0b8>,
 'trop': <gensim.models.keyedvectors.Vocab at 0x7f170c99a0f0>,
 'différente': <gensim.models.keyedvectors.Vocab at 0x7f170c99a128>,
 'habituellement,': <gensim.models.keyedvectors.Vocab at 0x7f170c99a160>,
 'suffit': <gensim.models.keyedvectors.Vocab at 0x7f170c99a198>,
 'bras': <gensim.models.keyedvectors.Vocab at 0x7f170c99a1d0>,
 'soulevé': <gensim.models.keyedvectors.Vocab at 0x7f170c99a208>,
 'arrêter': <gensim.models.keyedvectors.Vocab at 0x7f170c99a240>,
 'reculer': <gensim.models.keyedvectors.Vocab at 0x7f170c99a278>,
 'soleil,': <gensim.models.keyedvectors.Vocab at 0x7f170c99a2b0>,
 'première': <gensim.models.keyedvectors.Vocab at 0x7f170c99a2e8>,
 'minute': <gensim.models.keyedvectors.Vocab at 0x7f170c99a320>,
 'réveil,': <gensim.models.keyedvectors.Vocab at 0x7f170c99a358>,
 'saura': <gensim.models.keyedvectors.Vocab at 0x7f170c99a390>,
 'l’heure,': <gensim.models.keyedvectors.Vocab at 0x7f170c99a3c8>,
 's’il': <gensim.models.keyedvectors.Vocab at 0x7f170c99a400>,
 'exemple': <gensim.models.keyedvectors.Vocab at 0x7f170c99a438>,
 'dîner': <gensim.models.keyedvectors.Vocab at 0x7f170c99a470>,
 'assis': <gensim.models.keyedvectors.Vocab at 0x7f170c99a4a8>,
 'fauteuil,': <gensim.models.keyedvectors.Vocab at 0x7f170c99a4e0>,
 'alors': <gensim.models.keyedvectors.Vocab at 0x7f170c99a518>,
 'sera': <gensim.models.keyedvectors.Vocab at 0x7f170c99a550>,
 'complet': <gensim.models.keyedvectors.Vocab at 0x7f170c99a588>,
 'fauteuil': <gensim.models.keyedvectors.Vocab at 0x7f170c99a5c0>,
 'magique': <gensim.models.keyedvectors.Vocab at 0x7f170c99a5f8>,
 'fera': <gensim.models.keyedvectors.Vocab at 0x7f170c99a630>,
 'voyager': <gensim.models.keyedvectors.Vocab at 0x7f170c99a668>,
 'vitesse': <gensim.models.keyedvectors.Vocab at 0x7f170c99a6a0>,
 'l’espace,': <gensim.models.keyedvectors.Vocab at 0x7f170c99a6d8>,
 'au': <gensim.models.keyedvectors.Vocab at 0x7f170c99a710>,
 'paupières,': <gensim.models.keyedvectors.Vocab at 0x7f170c99a748>,
 'mois': <gensim.models.keyedvectors.Vocab at 0x7f170c99a780>,
 'tôt': <gensim.models.keyedvectors.Vocab at 0x7f170c99a7b8>,
 'autre': <gensim.models.keyedvectors.Vocab at 0x7f170c99a7f0>,
 'Mais': <gensim.models.keyedvectors.Vocab at 0x7f170c99a828>,
 'suffisait': <gensim.models.keyedvectors.Vocab at 0x7f170c99a860>,
 'que,': <gensim.models.keyedvectors.Vocab at 0x7f170c99a898>,
 'même,': <gensim.models.keyedvectors.Vocab at 0x7f170c99a8d0>,
 'fût': <gensim.models.keyedvectors.Vocab at 0x7f170c99a908>,
 'profond': <gensim.models.keyedvectors.Vocab at 0x7f170c99a940>,
 'entièrement': <gensim.models.keyedvectors.Vocab at 0x7f170c99a978>,
 'celui-ci': <gensim.models.keyedvectors.Vocab at 0x7f170c99a9b0>,
 'lâchait': <gensim.models.keyedvectors.Vocab at 0x7f170c99a9e8>,
 'plan': <gensim.models.keyedvectors.Vocab at 0x7f170c99aa20>,
 'lieu': <gensim.models.keyedvectors.Vocab at 0x7f170c99aa58>,
 'm’étais': <gensim.models.keyedvectors.Vocab at 0x7f170c99aa90>,
 'quand': <gensim.models.keyedvectors.Vocab at 0x7f170c99aac8>,
 'milieu': <gensim.models.keyedvectors.Vocab at 0x7f170c99ab00>,
 'j’ignorais': <gensim.models.keyedvectors.Vocab at 0x7f170c99ab38>,
 'trouvais,': <gensim.models.keyedvectors.Vocab at 0x7f170c99ab70>,
 'savais': <gensim.models.keyedvectors.Vocab at 0x7f170c99aba8>,
 'même': <gensim.models.keyedvectors.Vocab at 0x7f170c99abe0>,
 'premier': <gensim.models.keyedvectors.Vocab at 0x7f170c99ac18>,
 'instant': <gensim.models.keyedvectors.Vocab at 0x7f170c99ac50>,
 'seulement': <gensim.models.keyedvectors.Vocab at 0x7f170c99ac88>,
 'simplicité': <gensim.models.keyedvectors.Vocab at 0x7f170c99acc0>,
 'sentiment': <gensim.models.keyedvectors.Vocab at 0x7f170c99acf8>,
 'l’existence': <gensim.models.keyedvectors.Vocab at 0x7f170c99ad30>,
 'frémir': <gensim.models.keyedvectors.Vocab at 0x7f170c99ad68>,
 'fond': <gensim.models.keyedvectors.Vocab at 0x7f170c99ada0>,
 'dénué': <gensim.models.keyedvectors.Vocab at 0x7f170c99add8>,
 'l’homme': <gensim.models.keyedvectors.Vocab at 0x7f170c99ae10>,
 'non': <gensim.models.keyedvectors.Vocab at 0x7f170c99ae48>,
 'j’étais,': <gensim.models.keyedvectors.Vocab at 0x7f170c99ae80>,
 'quelques-uns': <gensim.models.keyedvectors.Vocab at 0x7f170c99aeb8>,
 'j’aurais': <gensim.models.keyedvectors.Vocab at 0x7f170c99aef0>,
 'pu': <gensim.models.keyedvectors.Vocab at 0x7f170c99af28>,
 'venait': <gensim.models.keyedvectors.Vocab at 0x7f170c99af60>,
 'd’en': <gensim.models.keyedvectors.Vocab at 0x7f170c99af98>,
 'haut': <gensim.models.keyedvectors.Vocab at 0x7f170c99afd0>,
 'tirer': <gensim.models.keyedvectors.Vocab at 0x7f170c99f048>,
 'néant': <gensim.models.keyedvectors.Vocab at 0x7f170c99f080>,
 'd’où': <gensim.models.keyedvectors.Vocab at 0x7f170c99f0b8>,
 'n’aurais': <gensim.models.keyedvectors.Vocab at 0x7f170c99f0f0>,
 'sortir': <gensim.models.keyedvectors.Vocab at 0x7f170c99f128>,
 'seul;': <gensim.models.keyedvectors.Vocab at 0x7f170c99f160>,
 'passais': <gensim.models.keyedvectors.Vocab at 0x7f170c99f198>,
 'par-dessus': <gensim.models.keyedvectors.Vocab at 0x7f170c99f1d0>,
 'siècles': <gensim.models.keyedvectors.Vocab at 0x7f170c99f208>,
 'l’image': <gensim.models.keyedvectors.Vocab at 0x7f170c99f240>,
 'confusément': <gensim.models.keyedvectors.Vocab at 0x7f170c99f278>,
 'entrevue': <gensim.models.keyedvectors.Vocab at 0x7f170c99f2b0>,
 'lampes': <gensim.models.keyedvectors.Vocab at 0x7f170c99f2e8>,
 'col': <gensim.models.keyedvectors.Vocab at 0x7f170c99f320>,
 'originaux': <gensim.models.keyedvectors.Vocab at 0x7f170c99f358>,
 'Peut-être': <gensim.models.keyedvectors.Vocab at 0x7f170c99f390>,
 'l’immobilité': <gensim.models.keyedvectors.Vocab at 0x7f170c99f3c8>,
 'choses': <gensim.models.keyedvectors.Vocab at 0x7f170c99f400>,
 'nous': <gensim.models.keyedvectors.Vocab at 0x7f170c99f438>,
 'leur': <gensim.models.keyedvectors.Vocab at 0x7f170c99f470>,
 'est-elle': <gensim.models.keyedvectors.Vocab at 0x7f170c99f4a8>,
 'imposée': <gensim.models.keyedvectors.Vocab at 0x7f170c99f4e0>,
 'certitude': <gensim.models.keyedvectors.Vocab at 0x7f170c99f518>,
 'elles': <gensim.models.keyedvectors.Vocab at 0x7f170c99f550>,
 'd’autres,': <gensim.models.keyedvectors.Vocab at 0x7f170c99f588>,
 'face': <gensim.models.keyedvectors.Vocab at 0x7f170c99f5c0>,
 'd’elles': <gensim.models.keyedvectors.Vocab at 0x7f170c99f5f8>,
 'Toujours': <gensim.models.keyedvectors.Vocab at 0x7f170c99f630>,
 'est-il': <gensim.models.keyedvectors.Vocab at 0x7f170c99f668>,
 'ainsi,': <gensim.models.keyedvectors.Vocab at 0x7f170c99f6a0>,
 'esprit': <gensim.models.keyedvectors.Vocab at 0x7f170c99f6d8>,
 'chercher,': <gensim.models.keyedvectors.Vocab at 0x7f170c99f710>,
 'réussir,': <gensim.models.keyedvectors.Vocab at 0x7f170c99f748>,
 'savoir': <gensim.models.keyedvectors.Vocab at 0x7f170c99f780>,
 'tournait': <gensim.models.keyedvectors.Vocab at 0x7f170c99f7b8>,
 'choses,': <gensim.models.keyedvectors.Vocab at 0x7f170c99f7f0>,
 'pays,': <gensim.models.keyedvectors.Vocab at 0x7f170c99f828>,
 'corps,': <gensim.models.keyedvectors.Vocab at 0x7f170c99f860>,
 'cherchait,': <gensim.models.keyedvectors.Vocab at 0x7f170c99f898>,
 'd’après': <gensim.models.keyedvectors.Vocab at 0x7f170c99f8d0>,
 'forme': <gensim.models.keyedvectors.Vocab at 0x7f170c99f908>,
 'fatigue,': <gensim.models.keyedvectors.Vocab at 0x7f170c99f940>,
 'ses': <gensim.models.keyedvectors.Vocab at 0x7f170c99f978>,
 'membres': <gensim.models.keyedvectors.Vocab at 0x7f170c99f9b0>,
 'direction': <gensim.models.keyedvectors.Vocab at 0x7f170c99f9e8>,
 'mur,': <gensim.models.keyedvectors.Vocab at 0x7f170c99fa20>,
 'place': <gensim.models.keyedvectors.Vocab at 0x7f170c99fa58>,
 'nommer': <gensim.models.keyedvectors.Vocab at 0x7f170c99fa90>,
 'demeure': <gensim.models.keyedvectors.Vocab at 0x7f170c99fac8>,
 'trouvait': <gensim.models.keyedvectors.Vocab at 0x7f170c99fb00>,
 'Sa': <gensim.models.keyedvectors.Vocab at 0x7f170c99fb38>,
 'mémoire,': <gensim.models.keyedvectors.Vocab at 0x7f170c99fb70>,
 'mémoire': <gensim.models.keyedvectors.Vocab at 0x7f170c99fba8>,
 'genoux,': <gensim.models.keyedvectors.Vocab at 0x7f170c99fbe0>,
 'épaules,': <gensim.models.keyedvectors.Vocab at 0x7f170c99fc18>,
 'présentait': <gensim.models.keyedvectors.Vocab at 0x7f170c99fc50>,
 'successivement': <gensim.models.keyedvectors.Vocab at 0x7f170c99fc88>,
 'plusieurs': <gensim.models.keyedvectors.Vocab at 0x7f170c99fcc0>,
 'chambres': <gensim.models.keyedvectors.Vocab at 0x7f170c99fcf8>,
 'dormi,': <gensim.models.keyedvectors.Vocab at 0x7f170c99fd30>,
 'tandis': <gensim.models.keyedvectors.Vocab at 0x7f170c99fd68>,
 'murs': <gensim.models.keyedvectors.Vocab at 0x7f170c99fda0>,
 'invisibles,': <gensim.models.keyedvectors.Vocab at 0x7f170c99fdd8>,
 'changeant': <gensim.models.keyedvectors.Vocab at 0x7f170c99fe10>,
 'selon': <gensim.models.keyedvectors.Vocab at 0x7f170c99fe48>,
 'pièce': <gensim.models.keyedvectors.Vocab at 0x7f170c99fe80>,
 'ténèbres': <gensim.models.keyedvectors.Vocab at 0x7f170c99feb8>,
 'pensée,': <gensim.models.keyedvectors.Vocab at 0x7f170c99fef0>,
 'hésitait': <gensim.models.keyedvectors.Vocab at 0x7f170c99ff28>,
 'seuil': <gensim.models.keyedvectors.Vocab at 0x7f170c99ff60>,
 'formes,': <gensim.models.keyedvectors.Vocab at 0x7f170c99ff98>,
 'eût': <gensim.models.keyedvectors.Vocab at 0x7f170c99ffd0>,
 'logis': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2048>,
 'rapprochant': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2080>,
 'circonstances,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a20b8>,
 'lui,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a20f0>,
 'rappelait': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2128>,
 'chacun': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2160>,
 'genre': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2198>,
 'lit,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a21d0>,
 'portes,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2208>,
 'prise': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2240>,
 'fenêtres,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2278>,
 'couloir,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a22b0>,
 'avec': <gensim.models.keyedvectors.Vocab at 0x7f170c9a22e8>,
 'réveil': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2320>,
 'cherchant': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2358>,
 'deviner': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2390>,
 'exemple,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a23c8>,
 'mur': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2400>,
 'grand': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2438>,
 'disais:': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2470>,
 '«Tiens,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a24a8>,
 'j’ai': <gensim.models.keyedvectors.Vocab at 0x7f170c9a24e0>,
 'fini': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2518>,
 'm’endormir': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2550>,
 'quoique': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2588>,
 'maman': <gensim.models.keyedvectors.Vocab at 0x7f170c9a25c0>,
 'soit': <gensim.models.keyedvectors.Vocab at 0x7f170c9a25f8>,
 'venue': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2630>,
 'dire': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2668>,
 'grand-père,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a26a0>,
 'mort': <gensim.models.keyedvectors.Vocab at 0x7f170c9a26d8>,
 'depuis': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2710>,
 'lequel': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2748>,
 'fidèles': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2780>,
 'passé': <gensim.models.keyedvectors.Vocab at 0x7f170c9a27b8>,
 'n’aurait': <gensim.models.keyedvectors.Vocab at 0x7f170c9a27f0>,
 'oublier,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2828>,
 'rappelaient': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2860>,
 'flamme': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2898>,
 'verre': <gensim.models.keyedvectors.Vocab at 0x7f170c9a28d0>,
 'suspendue': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2908>,
 'plafond': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2940>,
 'cheminée': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2978>,
 'marbre': <gensim.models.keyedvectors.Vocab at 0x7f170c9a29b0>,
 'chambre': <gensim.models.keyedvectors.Vocab at 0x7f170c9a29e8>,
 'Combray,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2a20>,
 'grands-parents,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2a58>,
 'jours': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2a90>,
 'lointains': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2ac8>,
 'qu’en': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2b00>,
 'figurais': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2b38>,
 'représenter': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2b70>,
 'exactement,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2ba8>,
 'reverrais': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2be0>,
 'mieux': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2c18>,
 'l’heure': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2c50>,
 'serais': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2c88>,
 'fait': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2cc0>,
 'éveillé': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2cf8>,
 'renaissait': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2d30>,
 'filait': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2d68>,
 'M^(me)': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2da0>,
 'Saint-Loup,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2dd8>,
 'Dieu!': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2e10>,
 'dix': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2e48>,
 'prolongé': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2e80>,
 'sieste': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2eb8>,
 'fais': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2ef0>,
 'tous': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2f28>,
 'soirs': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2f60>,
 'rentrant': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2f98>,
 'promenade': <gensim.models.keyedvectors.Vocab at 0x7f170c9a2fd0>,
 'habit': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8048>,
 'Car': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8080>,
 'ont': <gensim.models.keyedvectors.Vocab at 0x7f170c9a80b8>,
 'où,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a80f0>,
 'nos': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8128>,
 'retours': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8160>,
 'c’étaient': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8198>,
 'reflets': <gensim.models.keyedvectors.Vocab at 0x7f170c9a81d0>,
 'rouges': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8208>,
 'couchant': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8240>,
 'voyais': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8278>,
 'vitrage': <gensim.models.keyedvectors.Vocab at 0x7f170c9a82b0>,
 'fenêtre': <gensim.models.keyedvectors.Vocab at 0x7f170c9a82e8>,
 'mène': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8320>,
 'Tansonville,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8358>,
 'trouve': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8390>,
 'qu’à': <gensim.models.keyedvectors.Vocab at 0x7f170c9a83c8>,
 'suivre': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8400>,
 'clair': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8438>,
 'lune': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8470>,
 'chemins': <gensim.models.keyedvectors.Vocab at 0x7f170c9a84a8>,
 'jouais': <gensim.models.keyedvectors.Vocab at 0x7f170c9a84e0>,
 'jadis': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8518>,
 'soleil;': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8550>,
 'serai': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8588>,
 'endormi': <gensim.models.keyedvectors.Vocab at 0x7f170c9a85c0>,
 'm’habiller': <gensim.models.keyedvectors.Vocab at 0x7f170c9a85f8>,
 'dîner,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8630>,
 'loin': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8668>,
 'traversée': <gensim.models.keyedvectors.Vocab at 0x7f170c9a86a0>,
 'feux': <gensim.models.keyedvectors.Vocab at 0x7f170c9a86d8>,
 'lampe,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8710>,
 'seul': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8748>,
 'Ces': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8780>,
 'duraient': <gensim.models.keyedvectors.Vocab at 0x7f170c9a87b8>,
 'souvent': <gensim.models.keyedvectors.Vocab at 0x7f170c9a87f0>,
 'brève': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8828>,
 'incertitude': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8860>,
 'trouvais': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8898>,
 'distinguait': <gensim.models.keyedvectors.Vocab at 0x7f170c9a88d0>,
 'unes': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8908>,
 'autres': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8940>,
 'diverses': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8978>,
 'suppositions': <gensim.models.keyedvectors.Vocab at 0x7f170c9a89b0>,
 'faite,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a89e8>,
 'voyant': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8a20>,
 'cheval': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8a58>,
 'positions': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8a90>,
 'successives': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8ac8>,
 'revu': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8b00>,
 'tantôt': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8b38>,
 'l’une,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8b70>,
 'l’autre,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8ba8>,
 'finissais': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8be0>,
 'rappeler': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8c18>,
 'toutes': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8c50>,
 'longues': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8c88>,
 'rêveries': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8cc0>,
 'suivaient': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8cf8>,
 'd’hiver': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8d30>,
 'couché,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8d68>,
 'nid': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8da0>,
 'coin': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8dd8>,
 'couvertures,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8e10>,
 'bout': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8e48>,
 'bord': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8e80>,
 'numéro': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8eb8>,
 'Débats': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8ef0>,
 'roses,': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8f28>,
 'finit': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8f60>,
 'ensemble': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8f98>,
 'technique': <gensim.models.keyedvectors.Vocab at 0x7f170c9a8fd0>,
 'oiseaux': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae048>,
 'appuyant': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae080>,
 'glacial,': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae0b8>,
 'goûte': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae0f0>,
 'sentir': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae128>,
 'séparé': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae160>,
 'dehors': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae198>,
 '(comme': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae1d0>,
 'mer': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae208>,
 'feu': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae240>,
 'étant': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae278>,
 'entretenu': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae2b0>,
 'cheminée,': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae2e8>,
 'manteau': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae320>,
 'd’air': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae358>,
 'chaud': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae390>,
 'traversé': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae3c8>,
 'lueurs': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae400>,
 'sorte': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae438>,
 'creusée': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae470>,
 'sein': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae4a8>,
 'zone': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae4e0>,
 'ardente': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae518>,
 'mobile': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae550>,
 'souffles': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae588>,
 'figure': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae5c0>,
 'viennent': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae5f8>,
 'parties': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae630>,
 'voisines': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae668>,
 'éloignées': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae6a0>,
 'd’été': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae6d8>,
 'l’on': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae710>,
 'aime': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae748>,
 'uni': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae780>,
 'appuyé': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae7b8>,
 'volets': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae7f0>,
 'jette': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae828>,
 'jusqu’au': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae860>,
 'pied': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae898>,
 'échelle': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae8d0>,
 'presque': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae908>,
 'plein': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae940>,
 'air,': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae978>,
 'brise': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae9b0>,
 'pointe': <gensim.models.keyedvectors.Vocab at 0x7f170c9ae9e8>,
 'Louis': <gensim.models.keyedvectors.Vocab at 0x7f170c9aea20>,
 'gaie': <gensim.models.keyedvectors.Vocab at 0x7f170c9aea58>,
 'soir': <gensim.models.keyedvectors.Vocab at 0x7f170c9aea90>,
 'n’y': <gensim.models.keyedvectors.Vocab at 0x7f170c9aeac8>,
 'avais': <gensim.models.keyedvectors.Vocab at 0x7f170c9aeb00>,
 'malheureux,': <gensim.models.keyedvectors.Vocab at 0x7f170c9aeb38>,
 'soutenaient': <gensim.models.keyedvectors.Vocab at 0x7f170c9aeb70>,
 'légèrement': <gensim.models.keyedvectors.Vocab at 0x7f170c9aeba8>,
 'tant': <gensim.models.keyedvectors.Vocab at 0x7f170c9aebe0>,
 'montrer': <gensim.models.keyedvectors.Vocab at 0x7f170c9aec18>,
 'réserver': <gensim.models.keyedvectors.Vocab at 0x7f170c9aec50>,
 'lit;': <gensim.models.keyedvectors.Vocab at 0x7f170c9aec88>,
 'contraire': <gensim.models.keyedvectors.Vocab at 0x7f170c9aecc0>,
 'celle,': <gensim.models.keyedvectors.Vocab at 0x7f170c9aecf8>,
 'élevée': <gensim.models.keyedvectors.Vocab at 0x7f170c9aed30>,
 'hauteur': <gensim.models.keyedvectors.Vocab at 0x7f170c9aed68>,
 'deux': <gensim.models.keyedvectors.Vocab at 0x7f170c9aeda0>,
 'étages': <gensim.models.keyedvectors.Vocab at 0x7f170c9aedd8>,
 'partiellement': <gensim.models.keyedvectors.Vocab at 0x7f170c9aee10>,
 'dès': <gensim.models.keyedvectors.Vocab at 0x7f170c9aee48>,
 'seconde,': <gensim.models.keyedvectors.Vocab at 0x7f170c9aee80>,
 'l’odeur': <gensim.models.keyedvectors.Vocab at 0x7f170c9aeeb8>,
 'inconnue': <gensim.models.keyedvectors.Vocab at 0x7f170c9aeef0>,
 'convaincu': <gensim.models.keyedvectors.Vocab at 0x7f170c9aef28>,
 'l’hostilité': <gensim.models.keyedvectors.Vocab at 0x7f170c9aef60>,
 'rideaux': <gensim.models.keyedvectors.Vocab at 0x7f170c9aef98>,
 'violets': <gensim.models.keyedvectors.Vocab at 0x7f170c9aefd0>,
 'indifférence': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4048>,
 'pendule': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4080>,
 'n’eusse': <gensim.models.keyedvectors.Vocab at 0x7f170c9b40b8>,
 'là;': <gensim.models.keyedvectors.Vocab at 0x7f170c9b40f0>,
 'étrange': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4128>,
 'impitoyable': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4160>,
 'glace': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4198>,
 'pieds': <gensim.models.keyedvectors.Vocab at 0x7f170c9b41d0>,
 'obliquement': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4208>,
 'creusait': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4240>,
 'vif': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4278>,
 'plénitude': <gensim.models.keyedvectors.Vocab at 0x7f170c9b42b0>,
 'champ': <gensim.models.keyedvectors.Vocab at 0x7f170c9b42e8>,
 'accoutumé': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4320>,
 'heures': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4358>,
 'prendre': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4390>,
 'exactement': <gensim.models.keyedvectors.Vocab at 0x7f170c9b43c8>,
 'arriver': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4400>,
 'remplir': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4438>,
 'jusqu’en': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4470>,
 'gigantesque': <gensim.models.keyedvectors.Vocab at 0x7f170c9b44a8>,
 'souffert': <gensim.models.keyedvectors.Vocab at 0x7f170c9b44e0>,
 'dures': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4518>,
 'étendu': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4550>,
 'l’oreille': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4588>,
 'cœur': <gensim.models.keyedvectors.Vocab at 0x7f170c9b45c0>,
 'l’habitude': <gensim.models.keyedvectors.Vocab at 0x7f170c9b45f8>,
 'changé': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4630>,
 'couleur': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4668>,
 'rideaux,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b46a0>,
 'taire': <gensim.models.keyedvectors.Vocab at 0x7f170c9b46d8>,
 'enseigné': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4710>,
 'pitié': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4748>,
 'oblique': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4780>,
 'cruelle,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b47b8>,
 'sinon': <gensim.models.keyedvectors.Vocab at 0x7f170c9b47f0>,
 'chassé': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4828>,
 'complètement,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4860>,
 'diminué': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4898>,
 'apparente': <gensim.models.keyedvectors.Vocab at 0x7f170c9b48d0>,
 'habile': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4908>,
 'lente,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4940>,
 'commence': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4978>,
 'laisser': <gensim.models.keyedvectors.Vocab at 0x7f170c9b49b0>,
 'semaines': <gensim.models.keyedvectors.Vocab at 0x7f170c9b49e8>,
 'malgré': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4a20>,
 'heureux': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4a58>,
 'trouver,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4a90>,
 'car': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4ac8>,
 'réduit': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4b00>,
 'seuls': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4b38>,
 'serait': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4b70>,
 'impuissant': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4ba8>,
 'Certes,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4be0>,
 'dernière': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4c18>,
 'fois': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4c50>,
 'bon': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4c88>,
 'ange': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4cc0>,
 'arrêté': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4cf8>,
 'm’avait': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4d30>,
 'mis': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4d68>,
 'l’obscurité': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4da0>,
 'commode,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4dd8>,
 'bureau,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4e10>,
 'rue': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4e48>,
 'portes': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4e80>,
 'beau': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4eb8>,
 'demeures': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4ef0>,
 'l’ignorance': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4f28>,
 'présenté': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4f60>,
 'distincte,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4f98>,
 'croire': <gensim.models.keyedvectors.Vocab at 0x7f170c9b4fd0>,
 'présence': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7048>,
 'possible,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7080>,
 'donné': <gensim.models.keyedvectors.Vocab at 0x7f170c9b70b8>,
 'généralement': <gensim.models.keyedvectors.Vocab at 0x7f170c9b70f0>,
 'cherchais': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7128>,
 'grande': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7160>,
 'd’autrefois': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7198>,
 'grand’tante,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b71d0>,
 'Balbec,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7208>,
 'Paris,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7240>,
 'Doncières,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7278>,
 'Venise,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b72b0>,
 'ailleurs': <gensim.models.keyedvectors.Vocab at 0x7f170c9b72e8>,
 'encore,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7320>,
 'lieux,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7358>,
 'personnes': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7390>,
 'j’y': <gensim.models.keyedvectors.Vocab at 0x7f170c9b73c8>,
 'connues,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7400>,
 'vu': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7438>,
 'd’elles,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7470>,
 'm’en': <gensim.models.keyedvectors.Vocab at 0x7f170c9b74a8>,
 'raconté': <gensim.models.keyedvectors.Vocab at 0x7f170c9b74e0>,
 'fin': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7518>,
 'l’après-midi,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7550>,
 'longtemps': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7588>,
 'faudrait': <gensim.models.keyedvectors.Vocab at 0x7f170c9b75c0>,
 'mettre': <gensim.models.keyedvectors.Vocab at 0x7f170c9b75f8>,
 'rester,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7630>,
 'dormir,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7668>,
 'mère': <gensim.models.keyedvectors.Vocab at 0x7f170c9b76a0>,
 'grand’mère,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b76d8>,
 'redevenait': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7710>,
 'fixe': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7748>,
 'douloureux': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7780>,
 'préoccupations': <gensim.models.keyedvectors.Vocab at 0x7f170c9b77b8>,
 'On': <gensim.models.keyedvectors.Vocab at 0x7f170c9b77f0>,
 'distraire': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7828>,
 'l’air': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7860>,
 'lanterne': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7898>,
 'magique,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b78d0>,
 'dont,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7908>,
 'attendant': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7940>,
 'et,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7978>,
 'premiers': <gensim.models.keyedvectors.Vocab at 0x7f170c9b79b0>,
 'architectes': <gensim.models.keyedvectors.Vocab at 0x7f170c9b79e8>,
 'maîtres': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7a20>,
 'l’âge': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7a58>,
 'gothique,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7a90>,
 'substituait': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7ac8>,
 'apparitions': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7b00>,
 'vitrail': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7b38>,
 'momentané': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7b70>,
 'tristesse': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7ba8>,
 'n’en': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7be0>,
 'parce': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7c18>,
 'rien': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7c50>,
 'changement': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7c88>,
 'détruisait': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7cc0>,
 'quoi,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7cf8>,
 'sauf': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7d30>,
 'supplice': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7d68>,
 'coucher,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7da0>,
 'm’était': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7dd8>,
 'devenue': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7e10>,
 'supportable': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7e48>,
 'Maintenant': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7e80>,
 'reconnaissais': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7eb8>,
 'étais': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7ef0>,
 'inquiet,': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7f28>,
 'd’hôtel': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7f60>,
 'fusse': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7f98>,
 'arrivé': <gensim.models.keyedvectors.Vocab at 0x7f170c9b7fd0>,
 'descendant': <gensim.models.keyedvectors.Vocab at 0x7f17081e2048>,
 'fer': <gensim.models.keyedvectors.Vocab at 0x7f17081e2080>,
 'Au': <gensim.models.keyedvectors.Vocab at 0x7f17081e20b8>,
 'cheval,': <gensim.models.keyedvectors.Vocab at 0x7f17081e20f0>,
 'affreux': <gensim.models.keyedvectors.Vocab at 0x7f17081e2128>,
 'sortait': <gensim.models.keyedvectors.Vocab at 0x7f17081e2160>,
 'forêt': <gensim.models.keyedvectors.Vocab at 0x7f17081e2198>,
 'vert': <gensim.models.keyedvectors.Vocab at 0x7f17081e21d0>,
 'sombre': <gensim.models.keyedvectors.Vocab at 0x7f17081e2208>,
 'pente': <gensim.models.keyedvectors.Vocab at 0x7f17081e2240>,
 'colline,': <gensim.models.keyedvectors.Vocab at 0x7f17081e2278>,
 's’avançait': <gensim.models.keyedvectors.Vocab at 0x7f17081e22b0>,
 'château': <gensim.models.keyedvectors.Vocab at 0x7f17081e22e8>,
 'pauvre': <gensim.models.keyedvectors.Vocab at 0x7f17081e2320>,
 'Geneviève': <gensim.models.keyedvectors.Vocab at 0x7f17081e2358>,
 'Brabant': <gensim.models.keyedvectors.Vocab at 0x7f17081e2390>,
 'Ce': <gensim.models.keyedvectors.Vocab at 0x7f17081e23c8>,
 'coupé': <gensim.models.keyedvectors.Vocab at 0x7f17081e2400>,
 'ligne': <gensim.models.keyedvectors.Vocab at 0x7f17081e2438>,
 'courbe': <gensim.models.keyedvectors.Vocab at 0x7f17081e2470>,
 'guère': <gensim.models.keyedvectors.Vocab at 0x7f17081e24a8>,
 'limite': <gensim.models.keyedvectors.Vocab at 0x7f17081e24e0>,
 'glissait': <gensim.models.keyedvectors.Vocab at 0x7f17081e2518>,
 'entre': <gensim.models.keyedvectors.Vocab at 0x7f17081e2550>,
 'coulisses': <gensim.models.keyedvectors.Vocab at 0x7f17081e2588>,
 'qu’un': <gensim.models.keyedvectors.Vocab at 0x7f17081e25c0>,
 'pan': <gensim.models.keyedvectors.Vocab at 0x7f17081e25f8>,
 'château,': <gensim.models.keyedvectors.Vocab at 0x7f17081e2630>,
 'devant': <gensim.models.keyedvectors.Vocab at 0x7f17081e2668>,
 'rêvait': <gensim.models.keyedvectors.Vocab at 0x7f17081e26a0>,
 'portait': <gensim.models.keyedvectors.Vocab at 0x7f17081e26d8>,
 'ceinture': <gensim.models.keyedvectors.Vocab at 0x7f17081e2710>,
 'bleue': <gensim.models.keyedvectors.Vocab at 0x7f17081e2748>,
 'jaunes,': <gensim.models.keyedvectors.Vocab at 0x7f17081e2780>,
 'attendu': <gensim.models.keyedvectors.Vocab at 0x7f17081e27b8>,
 'connaître': <gensim.models.keyedvectors.Vocab at 0x7f17081e27f0>,
 'couleur,': <gensim.models.keyedvectors.Vocab at 0x7f17081e2828>,
 'car,': <gensim.models.keyedvectors.Vocab at 0x7f17081e2860>,
 'verres': <gensim.models.keyedvectors.Vocab at 0x7f17081e2898>,
 'sonorité': <gensim.models.keyedvectors.Vocab at 0x7f17081e28d0>,
 'l’avait': <gensim.models.keyedvectors.Vocab at 0x7f17081e2908>,
 'montrée': <gensim.models.keyedvectors.Vocab at 0x7f17081e2940>,
 'évidence': <gensim.models.keyedvectors.Vocab at 0x7f17081e2978>,
 'Golo': <gensim.models.keyedvectors.Vocab at 0x7f17081e29b0>,
 's’arrêtait': <gensim.models.keyedvectors.Vocab at 0x7f17081e29e8>,
 'écouter': <gensim.models.keyedvectors.Vocab at 0x7f17081e2a20>,
 'lu': <gensim.models.keyedvectors.Vocab at 0x7f17081e2a58>,
 'haute': <gensim.models.keyedvectors.Vocab at 0x7f17081e2a90>,
 'voix': <gensim.models.keyedvectors.Vocab at 0x7f17081e2ac8>,
 'comprendre': <gensim.models.keyedvectors.Vocab at 0x7f17081e2b00>,
 'parfaitement,': <gensim.models.keyedvectors.Vocab at 0x7f17081e2b38>,
 'n’excluait': <gensim.models.keyedvectors.Vocab at 0x7f17081e2b70>,
 'certaine': <gensim.models.keyedvectors.Vocab at 0x7f17081e2ba8>,
 'indications': <gensim.models.keyedvectors.Vocab at 0x7f17081e2be0>,
 's’éloignait': <gensim.models.keyedvectors.Vocab at 0x7f17081e2c18>,
 'lente': <gensim.models.keyedvectors.Vocab at 0x7f17081e2c50>,
 'Si': <gensim.models.keyedvectors.Vocab at 0x7f17081e2c88>,
 'distinguais': <gensim.models.keyedvectors.Vocab at 0x7f17081e2cc0>,
 'continuait': <gensim.models.keyedvectors.Vocab at 0x7f17081e2cf8>,
 's’avancer': <gensim.models.keyedvectors.Vocab at 0x7f17081e2d30>,
 'fenêtre,': <gensim.models.keyedvectors.Vocab at 0x7f17081e2d68>,
 'lui-même,': <gensim.models.keyedvectors.Vocab at 0x7f17081e2da0>,
 'essence': <gensim.models.keyedvectors.Vocab at 0x7f17081e2dd8>,
 'aussi': <gensim.models.keyedvectors.Vocab at 0x7f17081e2e10>,
 'surnaturelle': <gensim.models.keyedvectors.Vocab at 0x7f17081e2e48>,
 'celui': <gensim.models.keyedvectors.Vocab at 0x7f17081e2e80>,
 's’arrangeait': <gensim.models.keyedvectors.Vocab at 0x7f17081e2eb8>,
 'obstacle': <gensim.models.keyedvectors.Vocab at 0x7f17081e2ef0>,
 ...}
model.most_similar('madeleine')
/home/marcello/.local/lib/python3.7/site-packages/ipykernel_launcher.py:1: DeprecationWarning: Call to deprecated `most_similar` (Method will be removed in 4.0.0, use self.wv.most_similar() instead).
  """Entry point for launching an IPython kernel.
[('comédie', 0.9520220160484314),
 ('matinale', 0.9289866089820862),
 ('fraîcheur', 0.9280268549919128),
 ('déchéance', 0.9277333617210388),
 ('audience', 0.9262043833732605),
 ('direction,', 0.9231734275817871),
 ('cathédrale,', 0.9196324944496155),
 ('chaleur,', 0.918372631072998),
 ('tapisserie,', 0.9182485342025757),
 ('source', 0.9173475503921509)]
#un peu de paramètres: min_count= combien de fois un terme doit apparaître pour être mis dans le vocabulaire
model_sg = Word2Vec(sentences=sentences, min_count=1, sg=1, window=10)
model_sg.most_similar('madeleine')
/home/marcello/.local/lib/python3.7/site-packages/ipykernel_launcher.py:1: DeprecationWarning: Call to deprecated `most_similar` (Method will be removed in 4.0.0, use self.wv.most_similar() instead).
  """Entry point for launching an IPython kernel.
[('solitaire', 0.9496662616729736),
 ('décoration', 0.9432491064071655),
 ('magique,', 0.9428556561470032),
 ('Roussainville,', 0.9383882284164429),
 ('lecture,', 0.9381389021873474),
 ('tracé', 0.9377898573875427),
 ('tempête', 0.9358904957771301),
 ('boîte', 0.9357473850250244),
 ('blessure', 0.9343767166137695),
 ('fermée,', 0.9340600967407227)]

CBOW

Le principe est juste inversé: on prend en entrée le contexte pour deviner un mot cible.

Skip-Gram fonctionne bien avec de petits ensembles de données et peut mieux représenter les mots moins fréquents.

CBOW s’entraîne plus rapidement que Skip-Gram, et peut mieux représenter les mots plus fréquents.

model_cbow = Word2Vec(sentences=sentences, min_count=1, sg=0, window=10)
model_cbow.most_similar('madeleine')
/home/marcello/.local/lib/python3.7/site-packages/ipykernel_launcher.py:1: DeprecationWarning: Call to deprecated `most_similar` (Method will be removed in 4.0.0, use self.wv.most_similar() instead).
  """Entry point for launching an IPython kernel.
[('fenêtre', 0.9436046481132507),
 ('noire,', 0.9424415230751038),
 ('remplie', 0.9380182027816772),
 ('fenêtre,', 0.9375181794166565),
 ('voisine', 0.9338272213935852),
 ('étincelante', 0.9334274530410767),
 ('silhouette', 0.9321422576904297),
 ('servante,', 0.9317981004714966),
 ('pierre,', 0.9308982491493225),
 ('station', 0.9301747679710388)]
model.most_similar('temps')
/home/marcello/.local/lib/python3.7/site-packages/ipykernel_launcher.py:1: DeprecationWarning: Call to deprecated `most_similar` (Method will be removed in 4.0.0, use self.wv.most_similar() instead).
  """Entry point for launching an IPython kernel.
[('temps,', 0.8334174156188965),
 ('sens', 0.6833239197731018),
 ('train', 0.6688373684883118),
 ('chose', 0.6642275452613831),
 ('dehors', 0.6556645035743713),
 ('pays', 0.649366021156311),
 ('cas', 0.6439028978347778),
 ('souvenir', 0.6433770656585693),
 ('désir', 0.6371811032295227),
 ('plaisir', 0.6370058655738831)]

Glove

Article original

Au lieu que prendre en compte le vocabulaire comme input on part de la matrice des co-occurrences des mots. Donc GloVe (Global Vectors) prend en compte les statistiques d’occurences globales et non seulement locales (comme w2v).

Unlike the matrix factorization methods, the shallow window-based methods suffer from the disadvantage that they do not operate directly on the co-occurrence statistics of the corpus. Instead, these models scan context windows across the entire corpus, which fails to take advantage of the vast amount of repetition in the data.

Par exemple, admettons d’avoir la phrase:

La littérature est un art.

W2v sera capable de rendre compte du fait qu’il y a un lien entre “littérature” et “art”. Mais il ne saura pas profitter d’une autre information; combien de fois, dans l’ensemble du corpus, le mot “littérature” et le mot “art” se touvent ensemble?

GloVe ne se requiert pas un réseau de neurones. Les vecteurs sont déterminés sur la base de la probabilité de co-occurence des mots selon la formule:

\[ F(w_i, w_j, \tilde{w_k}) = \frac{Pi_k}{Pj_k} \]