Skip to content

Latest commit

 

History

History
129 lines (75 loc) · 12.4 KB

README.md

File metadata and controls

129 lines (75 loc) · 12.4 KB

Keyphrase Extraction Papers

⭐️⭐️⭐️Call For Cooperators⭐️⭐️⭐️

Organized by Mingyang Song (mingyang.song@bjtu.edu.cn).

LLM

  1. Is ChatGPT A Good Keyphrase Generator? A Preliminary Study, Mingyang Song, Haiyun Jiang, Shuming Shi, Songfang Yao, Shilong Lu, Yi Feng, Huafeng Liu, Liping Jing, ARXIV2023, Data.

Toolkit

  1. PKE: An Open Source Python-based Keyphrase Extraction Toolkit, Florian Boudin, COLING2016, Code.

  2. Self-supervised Contextual Keyword and Keyphrase Retrieval with Self-Labelling, Prafull Sharma and Yingbo Li, ArXiv2019, KeyBERT, Code.

Recent Popular Papers

Supervised Models

  1. Exploiting Topic-based Adversarial Neural Network for Cross-domain Keyphrase Extraction, Yanan Wang, Qi Liu, Chuan Qin, Tong Xu, Yijun Wang, Enhong Chen and Hui Xiong, ICDM2018, Code.

  2. Keyphrase Extraction Using Deep Recurrent Neural Networks on Twitter, Qi Zhang, Yang Wang, Yeyun Gong, Xuanjing Huang, EMNLP2016.

  3. Supervised Keyphrase Extraction as Positive Unlabeled Learning, Lucas Sterckx, Cornelia Caragea, Thomas Demeester, Chris Develder, EMNLP2016.

  4. Using Human Attention to Extract Keyphrase from Microblog Post, Yingyi Zhang, Chengzhi Zhang, ACL2019.

  5. Keyphrase Extraction from Scholarly Articles as Sequence Labeling using Contextualized Embeddings, Dhruva Sahrawat, Debanjan Mahata, Raymond Zhang, Mayank Kulkarni, Agniv Sharma, Rakesh Gosangi, Amanda Stent, Yaman Kumar, Rajiv Ratn Shah, Roger Zimmermann, ArXiv2019.

  6. Open Domain Web Keyphrase Extraction Beyond Language Modeling, Lee Xiong, Chuan Hu, Chenyan Xiong, Daniel Campos, Arnold Overwijk, EMNLP2019.

  7. Glocal: Incorporating Global Information in Local Convolution for Keyphrase Extraction, Animesh Prasad, Min-Yen Kan, NAACL2019.

  8. DivGraphPointer: A Graph Pointer Network for Extracting Diverse Keyphrases, Zhiqing Sun, Jian Tang, Pan Du, Zhi-Hong Deng, Jian-Yun Nie, SIGIR2019.

  9. Bi-LSTM-CRF Sequence Labeling for Keyphrase Extraction from Scholarly Documents, Rabah A. Al-Zaidy, Cornelia Caragea, C. Lee Giles, WWW2019.

  10. A Joint Learning Approach based on Self-Distillation for Keyphrase Extraction from Scientific Documents, Tuan Manh Lai, Trung Bui, Doo Soon Kim, Quan Hung Tran, COLING2020.

  11. Joint Keyphrase Chunking and Salience Ranking with BERT, Si Sun, Chenyan Xiong, Zhenghao Liu, Zhiyuan Liu, Jie Bao, ArXiv2020, Code.

  12. Keyphrase Extraction with Dynamic Graph Convolutional Networks and Diversified Inference, Haoyu Zhang, Dingkun Long, Guangwei Xu, Pengjun Xie, Fei Huang, Ji Wang, ArXiv2020.

  13. Keyphrase Extraction with Span-based Feature Representations, Funan Mu, Zhenting Yu, LiFeng Wang, Yequan Wang, Qingyu Yin, Yibo Sun, Liqun Liu, Teng Ma, Jing Tang, Xing Zhou, ArXiv2020.

  14. Incorporating Multimodal Information in Open-Domain Web Keyphrase Extraction, Yansen Wang et al., EMNLP2020, Code.

  15. Capturing Global Informativeness in Open Domain Keyphrase Extraction, Si Sun, Zhenghao Liu, Chenyan Xiong, Zhiyuan Liu, and Jie Bao, NLPCC2021, Code.

  16. Importance Estimation from Multiple Perspectives for Keyphrase Extraction, Mingyang Song, Liping Jing and Lin Xiao, EMNLP2021.

  17. Hyperbolic Relevance Matching for Neural Keyphrase Extraction, Mingyang Song, Yi Feng and Liping Jing, NAACL2022, Code.

Unsupervised Models

  1. Automatic Keyphrase Extraction via Topic Decomposition, Zhiyuan Liu, Wenyi Huang, Yabin Zheng and Maosong Sun, EMNLP2010.

  2. PositionRank: An Unsupervised Approach to Keyphrase Extraction from Scholarly Documents, Corina Florescu and Cornelia Caragea, ACL2017.

  3. SalienceRank: Efficient Keyphrase Extraction with Topic Modeling, Nedelina Teneva, Weiwei Cheng, ACL2017, Code.

  4. Simple Unsupervised Keyphrase Extraction using Sentence Embeddings, Kamil Bennani-Smires, Claudiu Musat, Andreaa Hossmann, Michael Baeriswyl, Martin Jaggi, CoNLL 2018, Code.

  5. WikiRank:Improving Keyphrase Extraction Based on Background Knowledge, Yang Yu, Vincent Ng, ArXiv2018.

  6. KeyGames: A Game Theoretic Approach to Automatic Keyphrase Extraction, Arnav Saxena, Mudit Mangal and Goonjan Jain, COLING2020(Outstanding Paper Award), Code.

  7. Scientific Keyphrase Identification and Classification by Pre-Trained Language Models Intermediate Task Transfer Learning, Seo Yeon Park, Cornelia Caragea, COLING2020.

  8. SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-Trained Language Model, YI SUN, HANGPING QIU, YU ZHENG, ZHONGWEI WANG, AND CHAORAN ZHANG, IEEE Access2020, Code.

  9. AttentionRank: Unsupervised keyphrase Extraction using Self and Cross Attentions, Haoran Ding and Xiao Luo, EMNLP2021, Code.

  10. Unsupervised Keyphrase Extraction by Jointly Modeling Local and Global Context, Xinnian Liang, Shuangzhi Wu, Mu Li, and Zhoujun Li, EMNLP2021, Code, Code for Chinese.

  11. MDERank: A Masked Document Embedding Rank Approach for Unsupervised Keyphrase Extraction, Linhan Zhang, Qian Chen, Wen Wang, Chong Deng, Shiliang Zhang, Bing Li, Wei Wang, Xin Cao, ArXiv2021.

  12. Extending Neural Keyword Extraction with TF-IDF tagset matching, Boshko Koloski et al., EACL2021.

  13. Multi-Document Keyphrase Extraction: A Literature Review and the First Dataset, Ori Shapira, Ramakanth Pasunuru, Ido Dagan, and Yael Amsterdamer, ArXiv2021.

  14. Enhancing Keyphrase Extraction from Academic Articles with their Reference Information, Chengzhi Zhang, Lei Zhao, Mengyuan Zhao, Yingyi Zhang, ArXiv2021, Code.

  15. Unsupervised Keyphrase Extraction via Interpretable Neural Networks, Rishabh Joshi, Vidhisha Balachandran, Emily Saldanha, Maria Glenski, Svitlana Volkova, Yulia Tsvetkov, ArXiv2022.

  16. AGRank: Augmented Graph-based Unsupervised Keyphrase Extraction, Haoran Ding and Xiao Luo, AACL2022.

  17. Improving Embedding-based Unsupervised Keyphrase Extraction by Incorporating Structural Information, Mingyang Song, Huafeng Liu, Yi Feng, Liping Jing, ACL2023.

  18. Unsupervised Keyphrase Extraction by Learning Neural Keyphrase Set Function, Mingyang Song, Haiyun Jiang, Lemao Liu, Shuming Shi, Liping Jing, ACL2023.

  19. Improving Diversity in Unsupervised Keyphrase Extraction with Determinantal Point Process, Mingyang Song, Huafeng Liu, Liping Jing, CIKM2023.

  20. HyperRank: Hyperbolic Ranking Model for Unsupervised Keyphrase Extraction, Mingyang Song, Huafeng Liu, Liping Jing, EMNLP2023.

  21. Mitigating Over-generation for Unsupervised Keyphrase Extraction with Heterogeneous Centrality Detection, Mingyang Song, Pengyu Xu, Yi Feng, Huafeng Liu, Liping Jing, EMNLP2023.

  22. PromptRank: Unsupervised Keyphrase Extraction Using Prompt, Aobo Kong, Shiwan Zhao, Hao Chen, Qicheng Li, Yong Qin, Ruiqi Sun, Xiaoyan Bai, ACL2023.

Survey

  1. Automatic Keyphrase Extraction: A Survey of the State of the Art, Kazi Saidul Hasan and Vincent Ng, ACL2014.
  2. A Review of Keyphrase Extraction, Eirini Papagiannopoulou and Grigorios Tsoumakas, ArXiv2019.
  3. A Survey on Recent Advances in Keyphrase Extraction from Pre-trained Language Models, Mingyang Song, Yi Feng and Liping Jing, EACL2023.

Dataset

ID Name Description Paper Conference
1 KP20k Scientific Paper Abstracts Deep Keyphrase Generation ACL2017
2 OpenKP Open Domain Open Domain Web Keyphrase Extraction Beyond Language Modeling EMNLP-IJCNLP2019
3 NUS Full-text Scientific Papers Keyphrase Extraction in Scientific Publications COLING2012-Demo
4 SemEval2010 Full-text Scientific Papers How Document Pre-processing affects Keyphrase Extraction Performance COLING2016-WorkShop
5 Krapivin Full-text Scientific Papers Large dataset for keyphrases extraction -
6 Inspec Scientific Paper Abstracts Improved automatic keyword extraction given more linguistic knowledge EMNLP2003
7 DUC2001 News Single Document Keyphrase Extraction Using Neighborhood Knowledge AAAI2008

The other descriptions (KeywordExtractor-Datasets, ake-datasets) of keyphrase extraction datasets.

Other Papers

Unified Supervised Models

  1. An Integrated Approach for Keyphrase Generation via Exploring the Power of Retrieval and Extraction, Wang Chen, Hou Pong Chan, Piji Li, Lidong Bing, Irwin King, NAACL2019, Code.

  2. Keyphrase Prediction With Pre-trained Language Model, Rui Liu, Zheng Lin, Weiping Wang, ArXiv2020.

  3. SenSeNet: Neural Keyphrase Generation with Document Structure, Yichao Luo, Zhengyan Li, Bingning Wang, Xiaoyu Xing, Qi Zhang, Xuanjing Huang, ArXiv2020.

  4. UniKeyphrase: A Unified Extraction and Generation Framework for Keyphrase Prediction, Huanqin Wu, Wei Liu, Lei Li, Dan Nie, Tao Chen, Feng Zhang, Di Wang, ACL2021, Code.