深度学习NLP方向论文列表--Sentence and Document Modeling

Sentence and Document Modeling

Phrase Modeling

  • Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg Corrado, and Jeff Dean, “Distributed Representations of Words and Phrases and their Compositionality,” pdf NIPS 2013
  • [socher’s]
  • [cutting RNN trees] Christian Scheible, Hinrich Schutze. “Cutting Recursive Autoencoder Trees” pdf CoRR abs/1301.2811 (2013)
  • [composition operators, mental-related similarity] Gershman, S. J., & Tenenbaum, J. B. “Phrase similarity in humans and machines”. pdf 2015 Proceedings of the 37th Annual Conference of the Cognitive Science Society, Proceedings of the 37th Annual Conference of the Cognitive Science Society.

Sentence Modeling

CNNs: convolution neural networks for sentence modeling

  • [convnet for sentences, dynamic, k-max pooling, stacked] Nal Kalchbrenner, Edward Grefenstette and Phil Blunsom. “A Convolutional Neural Network for Modelling Sentences” pdf ACL 2014.
  • [2D convolutional] Misha Denil, Alban Demiraj, Nal Kalchbrenner, Phil Blunsom, Nando de Freitas. “Modelling, Visualising and Summarising Documents with a Single Convolutional Neural Network” pdf in CoRR 2014.
  • [unsupervised pretraining for CNN] Wenpeng Yin and Hinrich Schutze. “Convolutional Neural Network for Paraphrase Identification.” pdf NAACL 2015
  • [convolute better with word order, parallel-CNN, different region] Rie Johnson and Tong Zhang. “Effective Use of Word Order for Text Categorization with Convolutional Neural Networks” pdf
  • Hermann, Karl Moritz, and Phil Blunsom. “Multilingual Models for Compositional
    Distributed Semantics.” pdf ACL 2014
  • Hermann, Karl Moritz, and Phil Blunsom. “Multilingual Distributed Representations
    without Word Alignment.” ACL 2014
  • Kim, Yoon. “Convolutional Neural Networks for Sentence Classification. “ arxiv : 2014
  • Le, Quoc V., and Tomas Mikolov. “Distributed Representations of Sentences and
    Documents.” ICML (2014).
  • [ARC-I, ARC-II, 2D convolutional, order perserving] Baotian Hu, Zhengdong Lu, Hang Li, etc. “Convolutional Neural Network Architectures for Matching Natural Language Sentences.” pdf NIPS 2014

RNNs and their variants

  • [RNN with GRUs] Ryan Kiros, Yukun Zhu, Ruslan Salakhutdinov, Richard S. Zemel, Antonio Torralba, Raquel Urtasun, Sanja Fidler, “Skip-Thought Vectors” pdf NIPS 2015

other NN architectures

  • [DAN, average, simple but effective] Mohit Iyyer, Varun Manjunatha, Jordan Boyd-Graber, and Hal Daumé III, “Deep Unordered Composition Rivals Syntactic Methods for Text Classification” pdf ACL 2015

Document Modeling

  • [2D convolutional] Misha Denil, Alban Demiraj, Nal Kalchbrenner, Phil Blunsom, Nando de Freitas. “Modelling, Visualising and Summarising Documents with a Single Convolutional Neural Network” pdf in CoRR 2014.
  • Hermann, Karl Moritz, and Phil Blunsom. “Multilingual Models for Compositional
    Distributed Semantics.” pdf ACL 2014
  • [deep RBM] Nitish Srivastava, Ruslan R Salakhutdinov, Geoffrey E. Hinton. “Modeling documents with a deep boltzmann machine.” pdf in Uncertainty in Artificial Intelligence, 2013
  • Chaochao Huang, Xipeng Qiu, Xuanjing Huang, “Text Classification with Document Embeddings” pdf Springer 2014
  • Le, Quoc V., and Tomas Mikolov. “Distributed Representations of Sentences and
    Documents.” ICML (2014).