ACL2021主会议论文汇总及分类

Original 刘聪NLP NLP工作站 2023-11-28

收录于合集 #论文集合 19个

往期推荐

难负例如何影响向量检索模型？

LM-MLC 一种基于完型填空的多标签分类算法

SIGIR2021论文：基于Text-to-Text多视图学习的段落重排序

SIGIR2021之DvBERT模型：双视图蒸馏的句向量BERT模型

SIGIR2021之IDCM模型: 文档内部级联选择段落服务于文档排序

Unilm对话生成之夸夸式闲聊机器人

超详细中文注释的GPT2新闻标题生成项目

ACL2021全部论文列表已经放出，为了以后更加方便地阅读论文，也本着一颗开源之心，花两个晚上的时间对主会议中的论文进行了分类整理，并附上了对应的论文链接。

主要包括10个分类，如下：（1）预训练语言模型及应用（58篇）；（2）表征学习（9篇）；（3）问答及检索（42篇）；（4）文本生成（29篇）；（5）摘要（23篇）；（6）小样本（16篇）；（7）对话（32篇）；（8）情感及情绪分析（15篇）；（9）信息抽取（60篇）；（10）其他（21篇）。

整理不易，请多多关注、转发、点赞。我们的口号是“生命不止，学习不停”，论文看起来吧。

一、预训练语言模型及应用

Long

（1）How Good is Your Tokenizer? On the Monolingual Performance ofMultilingual Language Models

https://arxiv.org/abs/2012.15613

（2）Meta-KD: A Meta Knowledge Distillation Framework for Language ModelCompression across Domains

https://arxiv.org/abs/2012.01266

（3）How is BERT surprised? Layerwise detection of linguistic anomalies

https://arxiv.org/abs/2105.07452

（4）Super Tickets in Pre-Trained Language Models: From Model Compressionto Improving Generalization

https://arxiv.org/abs/2105.12002

（5）R2D2: Recursive Transformer based on Differentiable Tree forInterpretable Hierarchical Language Modeling

（6）IrEne: Interpretable Energy Prediction for Transformers

https://arxiv.org/abs/2106.01199

（7）GhostBERT: Generate More Features with Cheap Operations for BERT

（8）Syntax-Enhanced Pre-trained Model

https://arxiv.org/abs/2012.14116

（9）PLOME: Pre-training with Misspelled Knowledge for Chinese SpellingCorrection

（10）EnsLM: Ensemble Language Model for Data Diversity by SemanticClustering

（11）StructFormer: Joint Unsupervised Induction of Dependency andConstituency Structure from Masked Language Modeling

https://arxiv.org/abs/2012.00857

（12）Convolutions and Self-Attention: Re-interpreting Relative Positionsin Pre-trained Language Models

https://arxiv.org/abs/2106.05505

（13）Implicit Representations of Meaning in Neural Language Model

https://arxiv.org/abs/2106.00737

（14）ERICA: Improving Entity and RelationUnderstanding for Pre-trained Language Models via Contrastive Learning

https://arxiv.org/abs/2012.15022

（15）Improving Formality Style Transfer with Context-Aware Rule Injection

https://arxiv.org/abs/2106.00210

（16）BinaryBERT: Pushing the Limit of BERT Quantization

https://arxiv.org/abs/2012.15701

（17）Shortformer: Better Language Modeling using Shorter Inputs

https://arxiv.org/abs/2012.15832

（18）Making Pre-trained Language Models Better Few-shot Learners

https://arxiv.org/abs/2012.15723

（19）ChineseBERT: Chinese Pretraining Enhanced by Glyph and PinyinInformation

（20）Are Pretrained Convolutions Better than Pretrained Transformers?

https://arxiv.org/abs/2105.03322

（21）ERNIE-Doc: A Retrospective Long-Document Modeling Transformer

https://arxiv.org/abs/2012.15688

（22）LeeBERT: Learned Early Exit for BERT with cross-level optimization

（23）Positional Artefacts Propagate Through Masked Language Model Embeddings

https://arxiv.org/abs/2011.04393

（24）Optimizing Deeper Transformers on Small Datasets

https://arxiv.org/abs/2012.15355

(25)When Do You Need Billionsof Words of Pretraining Data?

https://arxiv.org/abs/2011.04946

（26）Knowledgeable or Educated Guess? Revisiting Language Models asKnowledge Bases

https://arxiv.org/abs/2106.09231

（27）EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets

https://arxiv.org/abs/2101.00063

（28）SMedBERT: A Knowledge-Enhanced Pre-trained Language Model withStructured Semantics for Medical Text Mining

（29）Structural Guidance for Transformer Language Models

（30）MPC-BERT: A Pre-Trained Language Model for Multi-Party ConversationUnderstanding

https://arxiv.org/abs/2106.01541

（31）Language Model Evaluation Beyond Perplexity

https://arxiv.org/abs/2106.00085

（32）BERTGen: Multi-task Generation through BERT

https://arxiv.org/abs/2106.03484

（33）Pre-training Universal Language Representation

https://arxiv.org/abs/2105.14478

（34）Cascaded Head-colliding Attention

https://arxiv.org/abs/2105.14850

（35）Parameter-efficient Multi-task Fine-tuning for Transformers viaShared Hypernetworks

https://arxiv.org/abs/2106.04489

（36）Accelerating BERT Inference for Sequence Labeling via Early-Exit

https://arxiv.org/abs/2105.13878

（37）AutoTinyBERT: Automatic Hyper-parameter Optimization for EfficientPre-trained Language Models

（38）Lexicon Enhanced Chinese Sequence Labeling Using BERT Adapter

https://arxiv.org/abs/2105.07148

（39）On the Effectiveness of Adapter-based Tuning for Pretrained LanguageModel Adaptation

https://arxiv.org/abs/2106.03164

（40）Taming Pre-trained Language Models with N-gram Representations forLow-Resource Domain Adaptation

（41）Marginal Utility Diminishes: Exploring the Minimum Knowledge forBERT Knowledge Distillation

https://arxiv.org/abs/2106.05691

（42）Obtaining Better Static Word Embeddings UsingContextual Embedding Models

https://arxiv.org/abs/2106.04302

（43）Reflective Decoding: Beyond Unidirectional Generation withOff-the-Shelf Language Models

https://arxiv.org/abs/2010.08566

（44）Reservoir Transformers

https://arxiv.org/abs/2012.15045

（45）LexFit: Lexical Fine-Tuning of Pretrained Language Models

（46）Selecting Informative Contexts Improves Language Model Fine-tuning

https://arxiv.org/abs/2005.00175

（47）BERT is to NLP what AlexNet is to CV: Can Pre-Trained LanguageModels Identify Analogies?

https://arxiv.org/abs/2105.04949

（48）Examining the Inductive Bias of Neural Language Models withArtificial Languages

https://arxiv.org/abs/2106.01044

（49）An Empirical Study on Hyperparameter Optimization for Fine-TuningPre-trained Language Models

https://arxiv.org/abs/2106.09204

（50）BERTAC: Enhancing Transformer-based Language Models withAdversarially Pretrained Convolutional Neural Networks

（51）Enabling Lightweight Fine-tuning for Pre-trained Language ModelCompression based on Matrix Product Operators

https://arxiv.org/abs/2106.02205

（52）Length-Adaptive Transformer: Train Once with Length Drop, UseAnytime with Search

https://arxiv.org/abs/2010.07003

（53）H-Transformer-1D: Fast One-Dimensional Hierarchical Attention forSequences

Short

（54）Hi-Transformer: Hierarchical Interactive Transformer for Efficientand Effective Long Document Modeling

https://arxiv.org/abs/2106.01040

（55）Is Sparse Attention more Interpretable?

https://arxiv.org/abs/2106.01087

（56）Learning to Generate Task-Specific Adapters from Task Description

https://arxiv.org/abs/2101.00420

（57）Thank you BART! Rewarding Pre-Trained Models Improves FormalityStyle Transfer

https://arxiv.org/abs/2105.06947?context=cs

（58）Pre-training is a Hot Topic: Contextualized Document EmbeddingsImprove Topic Coherence

https://arxiv.org/abs/2004.03974

二、表征学习

Long

（1）DeCLUTR: Deep Contrastive Learning for Unsupervised TextualRepresentations

https://arxiv.org/abs/2006.03659（对比学习）

（2）Automated Concatenation of Embeddings for Structured Prediction

https://arxiv.org/abs/2010.05006

（3）Lightweight Cross-Lingual Sentence Representation Learning

https://arxiv.org/abs/2105.13856

（4）ConSERT: A Contrastive Framework for Self-Supervised SentenceRepresentation Transfer

https://arxiv.org/abs/2105.11741

（5）Dynamic Contextualized Word Embeddings

https://arxiv.org/abs/2010.12684

（6）Self-Guided Contrastive Learning for BERT Sentence Representations

https://arxiv.org/abs/2106.07345

（7）Bootstrapped Unsupervised Sentence Representation Learning

Short

（8）Attentive Multiview Text Representation for Differential Diagnosis

Short

（9）DefSent: Sentence Embeddings using Definition Sentences

https://arxiv.org/abs/2105.04339

三、问答及检索

Long

（1）Evaluating Evaluation Measures for Ordinal Classification andOrdinal Quantification

（2）Dual Reader-Parser on Hybrid Textual and Tabular Evidence for OpenDomain Question Answering

https://www.amazon.science/publications/dual-reader-parser-on-hybrid-textual-and-tabular-evidence-for-open-domain-question-answering

（3）Explanations for CommonsenseQA: New Dataset and Models

https://zenodo.org/record/4784281#.YNngJvkzZsY

（4）Answering Ambiguous Questions through Generative Evidence Fusion andRound-Trip Prediction

https://arxiv.org/abs/2011.13137

（5）Improving Document Representations by Generating Pseudo QueryEmbeddings for Dense Retrieval

https://arxiv.org/abs/2105.03599

（6）CoSQA: 20,000+ Web Queries for Code Search and Question Answering

https://arxiv.org/abs/2105.13239

（7）Coreference Reasoning in Machine Reading Comprehension

https://arxiv.org/pdf/2012.15573.pdf

（8）End-to-End Training of Neural Retrievers for Open-Domain QuestionAnswering

https://arxiv.org/abs/2101.00408

（9）Few-Shot Question Answering by Pretraining Span Selection

https://arxiv.org/abs/2101.00438

（10）Integrating Semantics and Neighborhood Information with Graph-DrivenGenerative Models for Document Retrieval

https://arxiv.org/abs/2105.13066

（11）Robustifying Multi-hop QA through Pseudo-Evidentiality Training

（12）Learning Dense Representations of Phrases at Scale

https://arxiv.org/abs/2012.12624

（13）Generation-Augmented Retrieval for Open-Domain Question Answering

https://arxiv.org/abs/2009.08553

（14）xMoCo: Cross Momentum Contrastive Learning for Open-Domain QuestionAnswering

（15）TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular andTextual Content in Finance

https://arxiv.org/abs/2105.07624

（16）A Semantic-based Method for Unsupervised Commonsense QuestionAnswering

https://arxiv.org/abs/2105.14781

（17）A Neural Model for Joint Document and Snippet Ranking in QuestionAnswering for Large Document Collections

https://arxiv.org/abs/2106.08908

（18）Challenges in Information-Seeking QA: Unanswerable Questions andParagraph Retrieval

https://arxiv.org/abs/2010.11915

（19）Question Answering Over Temporal Knowledge Graphs

https://arxiv.org/abs/2106.01515

（20）Can Generative Pre-trained Language Models Serve as Knowledge Basesfor Closed-book QA?

https://arxiv.org/abs/2106.01561

（21）Article Reranking by Memory-Enhanced Key Sentence Matching forDetecting Previously Fact-Checked Claims

（22）UnitedQA: A Hybrid Approach for Open Domain Question Answering

https://arxiv.org/abs/2101.00178

（23）ForecastQA: A Question Answering Challenge for Event Forecastingwith Temporal Text Data

https://arxiv.org/abs/2005.00792

（24）On the Efficacy of Adversarial Data Collection for QuestionAnswering: Results from a Large-Scale Randomized Study

https://arxiv.org/abs/2106.00872

（25）Multi-task Retrieval for Knowledge-Intensive Tasks

https://arxiv.org/abs/2101.00117

（26）Joint Models for Answer Verification in Question Answering Systems

（27）Which Linguist Invented the Lightbulb? Presupposition Verificationfor Question-Answering

https://arxiv.org/abs/2101.00391

（28）Modeling Transitions of Focal Entities for Conversational KnowledgeBase Question Answering

（29）A Mutual Information Maximization Approach for the Spurious SolutionProblem in Weakly Supervised Question Answering

https://arxiv.org/abs/2106.07174

（30）Learn to Resolve Conversational Dependency: A Consistency TrainingFramework for Conversational Question Answering

（31）Learning to Perturb Word Embeddings for Out-of-distribution QA

https://arxiv.org/abs/2105.02692

Short

（32）The Curse of Dense Low-Dimensional Information Retrieval for LargeIndex Sizes

https://arxiv.org/abs/2012.14210

（33）DuReader_robust: A Chinese Dataset Towards Evaluating Robustness andGeneralization of Machine Reading Comprehension in Real-World Applications

https://arxiv.org/abs/2004.11142

（34）Towards a more Robust Evaluation for Conversational QuestionAnswering

（35）Training Adaptive Computation for Open-DomainQuestion Answering with Computational Constraints

（36）Efficient Passage Retrieval with Hashing for Open-domain QuestionAnswering

https://arxiv.org/abs/2106.00882

（37）Using Adversarial Attacks to Reveal the Statistical Bias in MachineReading Comprehension Models

https://arxiv.org/abs/2105.11136

（38）VAULT: VAriable Unified Long Text Representation for Machine ReadingComprehension

https://arxiv.org/abs/2105.03229

（39）Towards more equitable question answering systems: How much moredata do you need?

https://arxiv.org/abs/2105.14115

（40）A Semantics-aware Transformer Model of Relation Linking forKnowledge Base Question Answering

（41）Neural Retrieval for Question Answering with Cross-AttentionSupervised Data Augmentation

https://arxiv.org/abs/2009.13815

（42）Addressing Semantic Drift in Generative Question Answering withAuxiliary Extraction

四、文本生成

Long

（1）Generalising Multilingual Concept-to-Text NLG with Language AgnosticDelexicalisation

https://arxiv.org/abs/2105.03432

（2）Prefix-Tuning: Optimizing Continuous Prompts for Generation

https://arxiv.org/abs/2101.00190

（3）Polyjuice: Generating Counterfactuals for Explaining, Evaluating,and Improving Models

https://arxiv.org/abs/2101.00288

（4）Competence-based Multimodal Curriculum Learning for Medical ReportGeneration

（5）BACO: A Background Knowledge- and Content-Based Framework for CitingSentence Generation

（6）Mention Flags (MF): Constraining Transformer-based Text Generators

（7）Guiding the Growth: Difficulty-Controllable Question Generationthrough Step-by-Step Rewriting

https://arxiv.org/abs/2105.11698

（8）Improving Encoder by Auxiliary Supervision Tasks for Table-to-TextGeneration

（9）Writing by Memorizing:Hierarchical Retrieval-based Medical Report Generation

https://arxiv.org/abs/2106.06471

（10）Data Augmentation for TextGeneration Without Any Augmented Data

https://arxiv.org/abs/2105.13650

（11）Long Text Generation byModeling Sentence-Level and Discourse-Level Coherence

https://arxiv.org/abs/2105.08963

（12）PENS: A Dataset and GenericFramework for Personalized News Headline Generation

https://www.microsoft.com/en-us/research/uploads/prod/2021/06/ACL2021_PENS_Camera_Ready_1862_Paper.pdf

（13）De-Confounded VariationalEncoder-Decoder for Logical Table-to-Text Generation

（14）Bridging Subword Gaps inPretrain-Finetune Paradigm for Natural Language Generation

https://arxiv.org/abs/2106.06125

（15）Employing ArgumentationKnowledge Graphs for Neural Argument Generation

（16）Select, Extract andGenerate: Neural Keyphrase Generation with Layer-wise Coverage Attention

https://arxiv.org/abs/2008.01739

（17）DESCGEN: A DistantlySupervised Datasetfor Generating Entity Descriptions

https://arxiv.org/abs/2106.05365

（18）GTM: A Generative Triple-wiseModel for Conversational Question Generation

https://arxiv.org/abs/2106.03635

（19）All That’s ‘Human’ Is NotGold: Evaluating Human Evaluation of Generated Text

（20）A Hierarchical VAE forCalibrating Attributes while Generating Text using Normalizing Flow

（21）DYPLOC: Dynamic Planning ofContent Using Mixed Language Models for Text Generation

https://arxiv.org/abs/2106.00791

（22）Controllable Open-endedQuestion Generation with A New Question Type Ontology

https://web.eecs.umich.edu/~wangluxy/papers/ACL2021_cao_wang.pdf

（23）DExperts: Decoding-TimeControlled Text Generation with Experts and Anti-Experts

https://arxiv.org/abs/2105.03023

（24）Towards Table-to-TextGeneration with Numerical Reasoning

（25）TGEA: An Error-AnnotatedDataset and Benchmark Tasks for TextGeneration from Pretrained Language Models

Short

（26）On Training InstanceSelection for Few-Shot Neural Text Generation

（27）How Helpful is InverseReinforcement Learning for Table-to-Text Generation?

（28）QuestionGeneration for Adaptive Education

https://arxiv.org/abs/2106.04262

（29）Avoiding Overlap in DataAugmentation for AMR-to-Text Generation

五、摘要

Long

（1）Cross-Lingual Abstractive Summarization with Limited ParallelResources

https://arxiv.org/abs/2105.13648

（2）Unsupervised Extractive Summarization-Based Representations forAccurate and Explainable Collaborative Filtering

（3）Improving Factual Consistency of Abstractive Summarization viaQuestion Answering

https://arxiv.org/abs/2105.04623

（4）Long-Span Summarization via Local Attention and Content Selection

https://arxiv.org/abs/2105.03801

（5）RepSum: Unsupervised Dialogue Summarization based on ReplacementStrategy

（6）TWAG: A Topic-Guided Wikipedia Abstract Generator

（7）Language Model as an Annotator: Exploring DialoGPT for DialogueSummarization

https://arxiv.org/abs/2105.12544

（8）BASS: Boosting Abstractive Summarization with Unified Semantic Graph

https://arxiv.org/abs/2105.12041

（9）Focus Attention: Promoting Faithfulness and Diversity inSummarization

https://arxiv.org/abs/2105.11921

（10）Deep Differential Amplifier for Extractive Summarization

（11）Generating Query Focused Summaries from Query-Free Resources

https://arxiv.org/abs/2012.14774

（12）PASS: Perturb-and-Select Summarizer for Product Reviews

https://www.amazon.science/publications/pass-perturb-and-select-summarizer-for-product-reviews

（13）ConvoSumm: Conversation Summarization Benchmark and ImprovedAbstractive Summarization with Argument Mining

https://arxiv.org/abs/2106.00829

（14）Multi-TimeLine Summarization (MTLS): Improving TimelineSummarization by Generating Multiple Summaries

（15）EmailSum: Abstractive Email Thread Summarization

（16）Dissecting Generation Modes for Abstractive Summarization Models viaAblation and Attribution

https://arxiv.org/abs/2106.01518

（17）A Training-free and Reference-free Summarization Evaluation Metricvia Centrality-weighted Relevance and Self-referenced Redundancy

https://arxiv.org/abs/2106.13945

（18）Generating SOAP Notes from Doctor-Patient Conversations UsingModular Summarization Techniques

https://arxiv.org/abs/2005.01795

Short

（19）WikiSum: Coherent Summarization Dataset forEfficient Human-Evaluation

https://registry.opendata.aws/wikisum/

（20）Bringing Structure into Summaries: a Faceted Summarization Datasetfor Long Scientific Documents

https://arxiv.org/abs/2106.00130

（21）Reinforcement Learning for Abstractive Question Summarization withQuestion-aware Semantic Rewards

（22）Demoting the Lead Bias in News Summarization via AlternatingAdversarial Learning

https://arxiv.org/abs/2105.14241

（23）SimCLS: A Simple Framework for Contrastive Learning of AbstractiveSummarization

https://arxiv.org/abs/2106.01890

六、小样本学习

Long

（1）Meta-KD: A Meta Knowledge Distillation Framework for Language ModelCompression across Domains

https://arxiv.org/abs/2012.01266

（2）Multi-Label Few-Shot Learning for Aspect Category Detection

https://arxiv.org/abs/2105.14174

（3）ProtAugment: Intent Detection Meta-Learning through UnsupervisedDiverse Paraphrasing

https://arxiv.org/abs/2105.12995

（4）Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision

https://arxiv.org/abs/2012.14862

（5）AugNLG: Few-shot Natural Language Generation using Self-trained DataAugmentation

https://arxiv.org/abs/2106.05589

（6）A Pre-training Strategy for Zero-Resource Response Selection inKnowledge-Grounded Conversations

（7）Evaluatingmorphological typology in zero-shot cross-lingual transfer

（8）LexiconLearning for Few Shot Sequence Modeling

https://arxiv.org/abs/2106.03993

（9）To POS Tagor Not to POS Tag: The Impact of POS Tags on Morphological Learning inLow-Resource Settings

（10）Meta-Learningto Compositionally Generalize

https://arxiv.org/abs/2106.04252

（11）RiskMinimization for Zero-shot Sequence Labeling

http://faculty.sist.shanghaitech.edu.cn/faculty/tukw/acl21rm.pdf

Short

（12）QA-DrivenZero-shot Slot Filling with Weak Supervision Pretraining

（13）Zero-shotFact Verification by Claim Generation

https://arxiv.org/abs/2105.14682

（14）DistinctLabel Representations for Few-Shot Text Classification

（15）Zero-shot Event Extraction via Transfer Learning: Challenges andInsights

（16）Issues with Entailment-based Zero-shot Text Classification

七、对话

Long

（1）TicketTalk: Toward human-level performance with end-to-end,transaction-based dialog systems

https://arxiv.org/abs/2012.12458

（2）SocAoG: Incremental Graph Parsing for Social Relation Inference inDialogues

https://arxiv.org/abs/2106.01006

（3）HERALD: An Annotation Efficient Method to Detect User Disengagementin Social Conversations

https://arxiv.org/abs/2106.00162

（4）Comprehensive Study: How the Context Information of DifferentGranularity Affects Dialogue State Tracking?

https://arxiv.org/abs/2105.03571

（5）Discovering Dialog Structure Graph for Coherent Dialog Generation

（6）Dialogue Response Selection with Hierarchical Curriculum Learning

https://arxiv.org/abs/2012.14756

（7）Diversifying Dialog Generation via Adaptive Label Smoothing

https://arxiv.org/abs/2105.14556

（8）BoB: BERT Over BERT for Training Persona-based Dialogue Models fromLimited Personalized Data

https://arxiv.org/abs/2106.06169

（9）I like fish, especially dolphins: Addressing Contradictions inDialogue Modeling

https://arxiv.org/abs/2012.13391

（10）Towards Quantifiable Dialogue Coherence Evaluation

https://arxiv.org/abs/2106.00507

（11）A Sequence-to-Sequence Approach to Dialogue State Tracking

https://arxiv.org/abs/2011.09553

（12）Dual Slot Selector via Local Reliability Verification for DialogueState Tracking

（13）Learning from Perturbations: Diverse and Informative DialogueGeneration with Inverse Adversarial Training

https://arxiv.org/abs/2105.15171

（14）Novel Slot Detection: A Benchmark for Discovering Unknown Slot Typesin the Task-Oriented Dialogue System

https://arxiv.org/abs/2105.14313

（15）RADDLE: An Evaluation Benchmark and Analysis Platform for RobustTask-oriented Dialog Systems

https://arxiv.org/abs/2012.14666

（16）Learning to Ask Conversational Questions by Optimizing LevenshteinDistance

（17）Conversations Are Not Flat: Modeling the Dynamic Information Flowacross Dialogue Utterances https://arxiv.org/abs/2106.02227

（18）Semantic Representation for Dialogue Modeling

https://arxiv.org/abs/2105.10188

（19）Towards Emotional Support Dialog Systems

https://arxiv.org/abs/2106.01144

（20）Discovering Dialogue Slots with Weak Supervision

（21）Structural Pre-training for Dialogue Comprehension

https://arxiv.org/abs/2105.10956

（22）Transferable Dialogue Systems and User Simulators

（23）Improving Dialog Systems for Negotiation with Personality Modeling

（24）TIMEDIAL: Temporal Commonsense Reasoning in Dialog

https://arxiv.org/abs/2106.04571

（25）Increasing Faithfulness in Knowledge-Grounded Dialogue withControllable Features

（26）GL-GIN: Fast and Accurate Non-Autoregressive Model for JointMultiple Intent Detection and Slot Filling

https://arxiv.org/abs/2106.01925

（27）DynaEval: Unifying Turn and Dialogue Level Evaluation

https://arxiv.org/abs/2106.01112

Short

（28）Saying No is An Art: Contextualized Fallback Responses forUnanswerable Dialogue Queries

https://arxiv.org/abs/2012.01873

（29）Preview, Attend and Review: Schema-Aware Curriculum Learning forMulti-Domain Dialogue State Tracking

https://arxiv.org/abs/2106.00291

（30）Continual Learning for Task-oriented Dialogue System with IterativeNetwork Pruning, Expanding and Masking

（31）Domain-Adaptive Pretraining Methods for Dialogue Understanding

https://arxiv.org/abs/2105.13665

（32）PRAL: A Tailored Pre-Training Model for Task-Oriented DialogGeneration

https://arxiv.org/abs/2004.13835

八、情感或情绪分析

Long

（1）Dual Graph Convolutional Networks for Aspect-based SentimentAnalysis

（2）Directed Acyclic Graph Network for Conversational EmotionRecognition

https://arxiv.org/abs/2105.12907

（3）DynaSent: A Dynamic Benchmark for Sentiment Analysis

https://arxiv.org/abs/2012.15349

（4）Position Bias Mitigation: A Knowledge-Aware Graph Model for EmotionCause Extraction

https://arxiv.org/abs/2106.03518

（5）Topic-Driven and Knowledge-Aware Transformer for Dialogue EmotionDetection

https://arxiv.org/abs/2106.01071

（6）Distributed Representations of Emotion Categories in Emotion Space

（7）DialogueCRN: Contextual Reasoning Networks for Emotion Recognitionin Conversations

https://arxiv.org/abs/2106.01978

（8）Missing Modality Imagination Network for Emotion Recognition withUncertain Missing Modalities

（9）A Unified Generative Framework for Aspect-based Sentiment Analysis

https://arxiv.org/abs/2106.04300

（10）Exploring the Efficacy of Automatically Generated Counterfactualsfor Sentiment Analysis

（11）Structured Sentiment Analysis as Dependency Graph Parsing

https://arxiv.org/abs/2105.14504

（12）Aspect-Category-Opinion-Sentiment Quadruple Extraction with ImplicitAspects and Opinions

Short

（13）Deep Context- and Relation-Aware Learning for Aspect-based SentimentAnalysis

https://arxiv.org/abs/2106.03806

（14）Towards Generative Aspect-Based Sentiment Analysis

（15）eMLM: A New Pre-training Objective for Emotion Related Tasks

九、信息抽取

Long

（1）Named Entity Recognition with Small Strongly Labeled and LargeWeakly Labeled Data

https://arxiv.org/abs/2106.08977

（2）Competence-based Multimodal Curriculum Learning for Medical ReportGeneration

https://arxiv.org/abs/2105.06804

（3）OntoED: Low-resource Event Detection with Ontology Embedding

https://arxiv.org/abs/2105.10922

（4）Subsequence Based Deep Active Learning for Named Entity Recognition

（5）BERTifying the Hidden Markov Model for Multi-Source WeaklySupervised Named Entity Recognition

https://arxiv.org/abs/2105.12848

（6）Knowledge-Enriched Event Causality Identification via LatentStructure Induction Networks

（7）Document-level Event Extraction via Heterogeneous Graph-basedInteraction Model with a Tracker

https://arxiv.org/abs/2105.14924

（8）A Large-Scale Chinese Multimodal NER Dataset with Speech Clues

（9）LearnDA: Learnable Knowledge-Guided Data Augmentation for EventCausality Identification

https://arxiv.org/abs/2106.01649

（10）CIL: Contrastive Instance Learning Framework for DistantlySupervised Relation Extraction

https://arxiv.org/abs/2106.10855

（11）Few-NERD: A Few-shot Named Entity Recognition Dataset

https://arxiv.org/abs/2105.07464

（12）SENT: Sentence-level Distant Relation Extraction via NegativeTraining

https://arxiv.org/abs/2106.11566?context=cs

（13）Modularized Interaction Network for Named Entity Recognition

（14）Capturing Event Argument Interaction via A Bi-DirectionalEntity-Level Recurrent Decoder

（15）A Span-Based Model for Joint Overlapped and Discontinuous NamedEntity Recognition

（16）An End-to-End Progressive Multi-Task Learning Framework for MedicalNamed Entity Recognition and Normalization

（17）MLBiNet: A Cross-Sentence Collective Event Detection Network

https://arxiv.org/abs/2105.09458

（18）PRGC: Potential Relation and Global Correspondence Based JointRelational Triple Extraction

https://arxiv.org/abs/2106.09895

（20）Improving Named Entity Recognition by External Context Retrievingand Cooperative Learning

https://arxiv.org/abs/2105.03654

（21）Leveraging Type Descriptions for Zero-shot Named Entity Recognitionand Classification

（22）Revisiting the Negative Data of Distantly Supervised RelationExtraction

https://arxiv.org/abs/2105.10158

（23）Learning from Miscellaneous Other-Class Words for Few-shot NamedEntity Recognition

（24）Joint Biomedical Entity and Relation Extraction withKnowledge-Enhanced Collective Inference

https://arxiv.org/abs/2105.13456

（25）Nested Named Entity Recognition via Explicitly Excluding theInfluence of the Best Path

（27）How Knowledge Graph and Attention Help? A Qualitative Analysis intoBag-level Relation Extraction

（28）From Discourse to Narrative: Knowledge Projection for Event RelationExtraction

https://arxiv.org/abs/2106.08629

（29）Fine-grained Information Extraction from Biomedical Literature basedon Knowledge-enriched Abstract Meaning Representation

（30）A Unified Generative Framework for Various NER Subtasks

https://arxiv.org/abs/2106.01223

（31）MECT: Multi-Metadata Embedding based Cross-Transformer for ChineseNamed Entity Recognition

（32）Unleash GPT-2 Power for Event Detection

（33）Trigger is Not Sufficient: Exploiting Frame-aware Knowledge forImplicit Event Argument Extraction

（34）Element Intervention for Open Relation Extraction

https://arxiv.org/abs/2106.09558

（35）Text2Event: Controllable Sequence-to-Structure Generation forEnd-to-end Event Extraction

https://arxiv.org/abs/2106.09232

（36）CLEVE: Contrastive Pre-training for Event Extraction

https://arxiv.org/abs/2105.14485

（37）MulDA: A Multilingual Data Augmentation Framework for Low-ResourceCross-Lingual NER

https://raihanjoty.github.io/papers/linlin-et-al-acl-21.html

（38）De-biasing Distantly Supervised Named Entity Recognition via CausalIntervention

https://arxiv.org/abs/2106.09233

（39）UniRE: A Unified Label Space for Entity Relation Extraction

（40）Crowdsourcing Learning as Domain Adaptation: A Case Study on NamedEntity Recognition

https://arxiv.org/abs/2105.14980

（41）Modeling Fine-Grained Entity Types with Box Embeddings

https://arxiv.org/abs/2101.00345

（42）CoRI: Collective Relation Integration with Data Augmentation forOpen Information Extraction

https://arxiv.org/abs/2106.00793

（43）CitationIE: Leveraging the Citation Graph for Scientific InformationExtraction

https://arxiv.org/abs/2106.01560

（44）Dependency-driven Relation Extraction with Attentive GraphConvolutional Networks

（45）Discontinuous Named Entity Recognition as Maximal Clique Discovery

https://arxiv.org/abs/2106.00218

（46）Weakly Supervised Named Entity Tagging with Learnable Logical Rules

（47）SpanNER: Named Entity Re-/Recognition as Span Prediction

https://arxiv.org/abs/2106.00641

（48）Refining Sample Embeddings with Relation Prototypes to EnhanceContinual Relation Extraction

https://www.researchgate.net/publication/352257560_Refining_Sample_Embeddings_with_Relation_Prototypes_to_Enhance_Continual_Relation_Extraction

（49）Document-level Event Extraction via Parallel Prediction Networks

（50）Learning Span-Level Interactions for Aspect Sentiment TripletExtraction

（51）The Possible, the Plausible, and the Desirable: Event-Based ModalityDetection for Language Processing

https://arxiv.org/abs/2106.08037

（52）A Neural Transition-based Joint Model for Disease Named EntityRecognition and Normalization

Short

（53）TIMERS: Document-level Temporal Relation Extraction

（54）ROPE: Reading Order Equivariant Positional Encoding for Graph-basedDocument Information Extraction

https://arxiv.org/abs/2106.10786

（55）Enhancing Entity Boundary Detection for Better Chinese Named EntityRecognition

（56）Entity Enhancement for Implicit Discourse Relation Classification inthe Biomedical Domain

（57）Entity Concept-enhanced Few-shot Relation Extraction

https://arxiv.org/abs/2106.02401

（58）Improving Model Generalization: A Chinese Named Entity RecognitionCase Study

（59）Explicitly Capturing Relations between Entity Mentions via GraphNeural Networks for Domain-specific Named Entity Recognition

（60）Three Sentences Are All You Need: Local Path Enhanced DocumentRelation Extraction

https://arxiv.org/abs/2106.01793

十、其他

（1）Semi-Supervised Text Classification with Balanced DeepRepresentation Distributions

（2）Defense against Adversarial Attacks in NLP via DirichletNeighborhood Ensemble

https://arxiv.org/abs/2006.11627

（3）Inter-GPS: Interpretable Geometry Problem Solving with FormalLanguage and Symbolic Reasoning

https://arxiv.org/abs/2105.04165

（4）Improving the Faithfulness of Attention-based Explanations withTask-specific Information for Text Classification

https://arxiv.org/abs/2105.02657

（5）Concept-Based Label Embedding via Dynamic Routing for HierarchicalText Classification

（6）Joint Verification and Reranking for Open Fact Checking Over Tables

https://arxiv.org/abs/2012.15115

（7）Structural Knowledge Distillation: Tractably Distilling Informationfor Structured Predictor

https://arxiv.org/abs/2010.05010

（8）UnNatural Language Inference

https://arxiv.org/abs/2101.00010

（9）OoMMix: Out-of-manifold Regularization in Contextual Embedding Spacefor Text Classification

https://arxiv.org/abs/2105.06750

（10）Database Reasoning Over Text

https://arxiv.org/abs/2106.01074

（11）Towards Robustness of Text-to-SQL Models against SynonymSubstitution

https://arxiv.org/abs/2106.01065

（12）Determinantal Beam Search

https://arxiv.org/abs/2106.07400

（13）POS-Constrained Parallel Decoding for Non-autoregressive Generation

（14）Hierarchy-aware Label Semantics Matching Network for HierarchicalText Classification

（15）Multi-View Cross-Lingual Structured Prediction with MinimumSupervision

http://faculty.sist.shanghaitech.edu.cn/faculty/tukw/acl21mv.pdf

（16）Chase: A Large-Scale and Pragmatic Chinese Dataset forCross-Database Context-Dependent Text-to-SQL

（17）Factoring Statutory Reasoning as Language Understanding Challenges

https://arxiv.org/abs/2105.07903

（18）HiddenCut: Simple Data Augmentation for Natural LanguageUnderstanding with Better Generalization

https://arxiv.org/abs/2106.00149

（19）KaggleDBQA: Realistic Evaluation of Text-to-SQL Parsers

https://arxiv.org/abs/2106.11455

（20）Automatic ICD Coding via Interactive Shared Representation Networkswith Self-distillation Mechanism

（21）Alignment Rationale for Natural Language Inference

由于本人涉及领域问题，没有对机器翻译、多模态、多语言等相关论文进行整理。如果有兴趣的朋友，可以整理后，私聊我，对该文进行增补。

原创不易，为爱发电。

知乎：刘聪NLP，文章同步在知乎更新，大家可以关注一波。

继续滑动看下一个

“家属和记者取得联系”：记者的退场意味深长

劲爆！为了姜萍两位女CEO互揭老底！

中石化一副总被曝出轨人妻，本人嚣张回应：旧情复燃尔

（待会删）大家低调浏览

又一女明星涉毒被判刑！自称为了“刺激大脑”创作，央视网发文痛批

ACL2021主会议论文汇总及分类

您可能也对以下帖子感兴趣

“家属和记者取得联系”：记者的退场意味深长

劲爆！为了姜萍两位女CEO互揭老底！

中石化一副总被曝出轨人妻，本人嚣张回应：旧情复燃尔

（待会删）大家低调浏览

又一女明星涉毒被判刑！自称为了“刺激大脑”创作，央视网发文痛批

生成图片，分享到微信朋友圈

ACL2021主会议论文汇总及分类

您可能也对以下帖子感兴趣