Zoom Q&A Session 1: Apr 21, (08:00-09:00 UTC)
Dialogue and Interactive Systems
- Jointly Improving Language Understanding and Generation with Quality-Weighted Weak Supervision of Automatic Labeling
- Conversational Question Answering over Knowledge Graphs with Transformer and Graph Attention Networks
- Augmenting Transformers with KNN-Based Composite Memory for Dialog
Zoom Q&A Session 2: Apr 21, (12:00-13:00 UTC)
Computational Social Choice and Social Media
- I Beg to Differ: A study of constructive disagreement in online conversations
- "Are you kidding me?": Detecting Unpalatable Questions on Reddit
- From the Stage to the Audience: Propaganda on Reddit
Natural Language Generation
- Civil Rephrases Of Toxic Texts With Self-Supervised Transformers
- Changing the Mind of Transformers for Topically-Controllable Language Generation
- Expanding, Retrieving and Infilling: Diversifying Cross-Domain Question Generation with Flexible Templates
Information Retrieval and Question Answering
- On the Calibration and Uncertainty of Neural Learning to Rank Models for Conversational Search
- Retrieval, Re-ranking and Multi-task Learning for Knowledge-Base Question Answering
- NoiseQA: Challenge Set Evaluation for User-Centric Question Answering
Gather Session 1: Apr 21, (13:00-15:00 UTC)
Information Extraction, Information Retrieval, Text Categorization and Question Answering
- On the Calibration and Uncertainty of Neural Learning to Rank Models for Conversational Search
- Retrieval, Re-ranking and Multi-task Learning for Knowledge-Base Question Answering
- CHOLAN: A Modular Approach for Neural Entity Linking on Wikipedia and Wikidata
- Predicting Treatment Outcome from Patient Texts:The Case of Internet-Based Cognitive Behavioural Therapy
- GRIT: Generative Role-filler Transformers for Document-level Event Entity Extraction
- TrNews: Heterogeneous User-Interest Transfer Learning for News Recommendation
- Adv-OLM: Generating Textual Adversaries via OLM
- DRAG: Director-Generator Language Modelling Framework for Non-Parallel Author Stylized Rewriting
- Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering
- Clinical Outcome Prediction from Admission Notes using Self-Supervised Knowledge Integration
- Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets
- Dynamic Graph Transformer for Implicit Tag Recognition
- Bootstrapping Relation Extractors using Syntactic Search by Examples
- Exploiting Position and Contextual Word Embeddings for Keyphrase Extraction from Scientific Papers
- MultiHumES: Multilingual Humanitarian Dataset for Extractive Summarization
- Boosting Low-Resource Biomedical QA via Entity-Aware Masking Strategies
- Event-Driven News Stream Clustering using Entity-Aware Contextual Embeddings
- Complementary Evidence Identification in Open-Domain Question Answering
- NoiseQA: Challenge Set Evaluation for User-Centric Question Answering
- LESA: Linguistic Encapsulation and Semantic Amalgamation Based Generalised Claim Detection from Online Content
- Mega-COV: A Billion-Scale Dataset of 100+ Languages for COVID-19
- Supervised and Unsupervised Neural Approaches to Text Readability
Computational Social Choice, Social Media, Sentiment Analysis, Stylistic Analysis and Argument Mining
- Belief-based Generation of Argumentative Claims
- Implicitly Abusive Comparisons – A New Dataset and Linguistic Analysis
- Semantic Oppositeness Assisted Deep Contextual Modeling for Automatic Rumor Detection in Social Networks
- FakeFlow: Fake News Detection by Modeling the Flow of Affective Information
- Hierarchical Multi-head Attentive Network for Evidence-aware Fake News Detection
- SpanEmo: Casting Multi-label Emotion Classification as Span-prediction
- NewsMTSC: A Dataset for (Multi-)Target-dependent Sentiment Classification in Political News Articles
- Open-Mindedness and Style Coordination in Argumentative Discussions
- A New View of Multi-modal Language Analysis: Audio and Video Features as Text "Styles"
- Attention-based Relational Graph Convolutional Network for Target-Oriented Opinion Words Extraction
- I Beg to Differ: A study of constructive disagreement in online conversations
- "Are you kidding me?": Detecting Unpalatable Questions on Reddit
- Stereotype and Skew: Quantifying Gender Bias in Pre-trained and Fine-tuned Language Models
- Adversarial Learning of Poisson Factorisation Model for Gauging Brand Sentiment in User Reviews
- Adversarial Stylometry in the Wild: Transferable Lexical Substitution Attacks on Author Profiling
- PHASE: Learning Emotional Phase-aware Representations for Suicide Ideation Detection on Social Media
- A Few Topical Tweets are Enough for Effective User Stance Detection
- EmpathBERT: A BERT-based Framework for Demographic-aware Empathy Prediction
- Variational Weakly Supervised Sentiment Analysis with Posterior Regularization
- From the Stage to the Audience: Propaganda on Reddit
- Why Is MBTI Personality Detection from Texts a Difficult Task?
- Metrical Tagging in the Wild: Building and Annotating Poetry Corpora with Rhythmic Features
- Enhancing Aspect-level Sentiment Analysis with Word Dependencies
Dialogue and Interactive Systems, Natural language Generation and Summarization
- Contrastive Multi-document Question Generation
- Recipes for Building an Open-Domain Chatbot
- Polarized-VAE: Proximity Based Disentangled Representation Learning for Text Generation
- Grounding as a Collaborative Process
- Does the Order of Training Samples Matter? Improving Neural Data-to-Text Generation with Curriculum Learning
- Dialogue Act-based Breakdown Detection in Negotiation Dialogues
- Neural Data-to-Text Generation with LM-based Text Augmentation
- Jointly Improving Language Understanding and Generation with Quality-Weighted Weak Supervision of Automatic Labeling
- Conversational Question Answering over Knowledge Graphs with Transformer and Graph Attention Networks
- Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs
- Analyzing the Forgetting Problem in Pretrain-Finetuning of Open-domain Dialogue Response Models
- Implicit Unlikelihood Training: Improving Neural Text Generation with Reinforcement Learning
- Civil Rephrases Of Toxic Texts With Self-Supervised Transformers
- Generating Weather Comments from Meteorological Simulations
- With Measured Words: Simple Sentence Selection for Black-Box Optimization of Sentence Compression Algorithms
- ChainCQG: Flow-Aware Conversational Question Generation
- Neural-Driven Search-Based Paraphrase Generation
- Changing the Mind of Transformers for Topically-Controllable Language Generation
- Expanding, Retrieving and Infilling: Diversifying Cross-Domain Question Generation with Flexible Templates
- Modeling Coreference Relations in Visual Dialog
- Don't Change Me! User-Controllable Selective Paraphrase Generation
- Augmenting Transformers with KNN-Based Composite Memory for Dialog
Machine Translation, Language Resources and Evaluation
- Does She Wink or Does She Nod? A Challenging Benchmark for Evaluating Word Understanding of Language Models
- CTC-based Compression for Direct Speech Translation
- TDMSci: A Specialized Corpus for Scientific Literature Entity Tagging of Tasks Datasets and Metrics
- Continuous Learning in Neural Machine Translation using Bilingual Dictionaries
- Multilingual Machine Translation: Closing the Gap between Shared and Language-specific Encoder-Decoders
- Clustering Word Embeddings with Self-Organizing Maps. Application on LaRoSeDa - A Large Romanian Sentiment Data Set
- Few-shot learning through contextual data augmentation
- SICK-NL: A Dataset for Dutch Natural Language Inference
- Understanding Pre-Editing for Black-Box Neural Machine Translation
- WiC-TSV: An Evaluation Benchmark for Target Sense Verification of Words in Context
- Alignment verification to improve NMT translation towards highly inflectional languages with limited resources
- Word Alignment by Fine-tuning Embeddings on Parallel Corpora
- Machine Translationese: Effects of Algorithmic Bias on Linguistic Complexity in Machine Translation
- Context-aware Neural Machine Translation with Mini-batch Embedding
- Streaming Models for Joint Speech Recognition and Translation
- `Just because you are right, doesn't mean I am wrong': Overcoming a bottleneck in development and evaluation of Open-Ended VQA tasks
- CLiMP: A Benchmark for Chinese Language Model Evaluation
- MTOP: A Comprehensive Multilingual Task-Oriented Semantic Parsing Benchmark
- Recipes for Adapting Pre-trained Monolingual and Multilingual Models to Machine Translation
- A Large-scale Evaluation of Neural Machine Transliteration for Indic Languages
- Communicative-Function-Based Sentence Classification for Construction of an Academic Formulaic Expression Database
- Adaptation of Back-translation to Automatic Post-Editing for Synthetic Data Generation
Lexical Semantics, Sentence-Level Semantics, and Natural Language Grrounding
- On the (In)Effectiveness of Images for Text Classification
- Dictionary-based Debiasing of Pre-trained Word Embeddings
- FEWS: Large-Scale, Low-Shot Word Sense Disambiguation with the Dictionary
- Elastic weight consolidation for better bias inoculation
- Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation
- Debiasing Pre-trained Contextualised Embeddings
- Language Models for Lexical Inference in Context
- On the Evaluation of Vision-and-Language Navigation Instructions
- Cross-lingual Visual Pre-training for Multimodal Machine Translation
- RelWalk - A Latent Variable Model Approach to Knowledge Graph Embedding
- The Interplay of Task Success and Dialogue Quality: An in-depth Evaluation in Task-Oriented Visual Dialogues
- A Unified Feature Representation for Lexical Connotations
- L2C: Describing Visual Differences Needs Semantic Understanding of Individuals
- Structural Encoding and Pre-training Matter: Adapting BERT for Table-Based Fact Verification
- Exploiting Definitions for Frame Identification
- Handling Out-Of-Vocabulary Problem in Hangeul Word Embeddings
- Exploiting Multimodal Reinforcement Learning for Simultaneous Machine Translation
- STAR: Cross-modal [STA]tement [R]epresentation for selecting relevant mathematical premises
- Framing Word Sense Disambiguation as a Multi-Label Problem for Model-Agnostic Knowledge Integration
- Increasing Robustness to Spurious Correlations using Forgettable Examples
- On Robustness of Neural Semantic Parsers
- The Chinese Remainder Theorem for Compact, Task-Precise, Efficient and Secure Word Embeddings
- Lifelong Knowledge-Enriched Social Event Representation Learning
Demos
- A Dashboard for Mitigating the COVID-19 Misinfodemic
- A description and demonstration of SAFAR framework
- Breaking Writer's Block: Low-cost Fine-tuning of Natural Language Generation Models
- COCO-EX: A Tool for Linking Concepts from Texts to ConceptNet
- Domain Expert Platform for Goal-Oriented Dialog Collection
- EasyTurk: A User-Friendly Interface for High-Quality Linguistic Annotation with Amazon Mechanical Turk
- European Language Grid: A Joint Platform for the European Language Technology Community
- Finite-state script normalization and processing utilities: The Nisaba Brahmic library
- HULK: An Energy Efficiency Benchmark Platform for Responsible Natural Language Processing
- InterpreT: An Interactive Visualization Tool for Interpreting Transformers
- LOME: Large Ontology Multilingual Extraction
- MadDog: A Web-based System for Acronym Identification and Disambiguation
- Massive Choice, Ample Tasks (MaChAmp): A Toolkit for Multi-task Learning in NLP
- MATILDA - Multi-AnnoTator multi-language InteractiveLight-weight Dialogue Annotator
- OCTIS: Comparing and Optimizing Topic models is Simple!
- OPUS-CAT: Desktop NMT with CAT integration and local fine-tuning
- Paladin: an annotation tool based on active and proactive learning
- SF-QA: Simple and Fair Evaluation Library for Open-domain Question Answering
- Story Centaur: Large Language Model Few Shot Learning as a Creative Writing Tool
- T-NER: An All-Round Python Library for Transformer-based Named Entity Recognition
Zoom Q&A Session 3: Apr 22, (07:00-08:00 UTC)
Discourse and Pragmatics
- Joint Coreference Resolution and Character Linking for Multiparty Conversation
- Top-down Discourse Parsing via Sequence Labelling
- Rethinking Coherence Modeling: Synthetic vs. Downstream Tasks
Information Extraction
- Identifying Named Entities as they are Typed
- Language Models as Knowledge Bases: On Entity Representations, Storage Capacity, and Paraphrased Queries
- An End-to-end Model for Entity-level Relation Extraction using Multi-instance Learning
Zoom Q&A Session 4: Apr 22, (08:00-09:00 UTC)
Natural Language Grounding
- ECOL-R: Encouraging Copying in Novel Object Captioning with Reinforcement Learning
- An Empirical Study on the Generalization Power of Neural Representations Learned via Visual Guessing Games
- Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO
Zoom Q&A Session 5: Apr 22, (12:00-13:00 UTC)
Dialogue and Interactive Systems
- Zero-shot Generalization in Dialog State Tracking through Generative Question Answering
- MIDAS: A Dialog Act Annotation Scheme for Open Domain HumanMachine Spoken Conversations
- Alternating Recurrent Dialog Model with Large-scale Pre-trained Language Models
Document Analysis and Text Classification
- Adaptive Mixed Component LDA for Low Resource Topic Modeling
- Hidden Biases in Unreliable News Detection Datasets
- ProFormer: Towards On-Device LSH Projection Based Transformers
Linguistic Theories, Cognitive Modeling and Psycholinguistics
- Disambiguatory Signals are Stronger in Word-initial Positions
- Deep Subjecthood: Higher-Order Grammatical Features in Multilingual BERT
- Probing for idiomaticity in vector space models
Sentiment Analysis, Stylistic Analysis and Argument Mining
- Learning From Revisions: Quality Assessment of Claims in Argumentation at Scale
- "Laughing at you or with you": The Role of Sarcasm in Shaping the Disagreement Space
- Challenges in Automated Debiasing for Toxic Language Detection
Student Research Workshop
- A Computational Analysis of Vagueness in Revisions of Instructional Texts
- Familiar words but strange voices: Modelling the influence of speech variability on word recognition
- Towards Personalised and Document-level Machine Translation of Dialogue
- TMR: Evaluating NER Recall on Tough Mentions
Gather Session 2: Apr 22, (13:00-15:00 UTC)
Information Extraction, Information Retrieval, Text Categorization and Question Answering
- Unification-based Reconstruction of Multi-hop Explanations for Science Questions
- Knowledge Base Question Answering through Recursive Hypergraphs
- Identifying Named Entities as they are Typed
- Cross-lingual Contextualized Topic Models with Zero-shot Learning
- Language Models as Knowledge Bases: On Entity Representations, Storage Capacity, and Paraphrased Queries
- How Certain is Your Transformer?
- Learning Relatedness between Types with Prototypes for Relation Extraction
- Probing into the Root: A Dataset for Reason Extraction of Structural Events from Financial Documents
- FAST: Financial News and Tweet Based Time Aware Network for Stock Trading
- Adaptive Mixed Component LDA for Low Resource Topic Modeling
- Hidden Biases in Unreliable News Detection Datasets
- LSOIE: A Large-Scale Dataset for Supervised Open Information Extraction
- Progressively Pretrained Dense Corpus Index for Open-Domain Question Answering
- Exploring the Limits of Few-Shot Link Prediction in Knowledge Graphs
- ProFormer: Towards On-Device LSH Projection Based Transformers
- ENPAR:Enhancing Entity and Entity Pair Representations for Joint Entity Relation Extraction
- Representations for Question Answering from Documents with Tables and Text
- Modeling Context in Answer Sentence Selection Systems on a Latency Budget
- Do Multi-Hop Question Answering Systems Know How to Answer the Single-Hop Sub-Questions?
- One-class Text Classification with Multi-modal Deep Support Vector Data Description
- Regulatory Compliance through Doc2Doc Information Retrieval: A case study in EU/UK legislation where text similarity has limitations
- An End-to-end Model for Entity-level Relation Extraction using Multi-instance Learning
Computational Social Choice, Social Media, Sentiment Analysis, Stylistic Analysis and Argument Mining
- If you've got it, flaunt it: Making the most of fine-grained sentiment annotations
- ResPer: Computationally Modelling Resisting Strategies in Persuasive Conversations
- CD^2CR: Co-reference resolution across documents and domains
- Joint Coreference Resolution and Character Linking for Multiparty Conversation
- Improving Factual Consistency Between a Response and Persona Facts
- Top-down Discourse Parsing via Sequence Labelling
- Ellipsis Resolution as Question Answering: An Evaluation
- Mode Effects' Challenge to Authorship Attribution
- Automatic Data Acquisition for Event Coreference Resolution
- An Expert Annotated Dataset for the Detection of Online Misogyny
- Cross-Topic Rumor Detection using Topic-Mixtures
- Learning From Revisions: Quality Assessment of Claims in Argumentation at Scale
- What Sounds "Right" to Me? Experiential Factors in the Perception of Political Ideology
- Us vs. Them: A Dataset of Populist Attitudes, News Bias and Emotions
- "Laughing at you or with you": The Role of Sarcasm in Shaping the Disagreement Space
- Content-based Models of Quotation
- Scientific Discourse Tagging for Evidence Extraction
- From Toxicity in Online Comments to Incivility in American News: Proceed with Caution
- BERTective: Language Models and Contextual Information for Deception Detection
- Gender and Racial Fairness in Depression Research using Social Media
- Challenges in Automated Debiasing for Toxic Language Detection
- Rethinking Coherence Modeling: Synthetic vs. Downstream Tasks
- Is the Understanding of Explicit Discourse Relations Required in Machine Reading Comprehension?
Morphology and Syntax, Linguistic and Cognitive Modeling, Interpretability and Analysis
- "Talk to me with left, right, and angles": Lexical entrainment in spoken Hebrew dialogue
- A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
- Self-Training Pre-Trained Language Models for Zero- and Few-Shot Multi-Dialectal Arabic Sequence Labeling
- Coordinate Constructions in English Enhanced Universal Dependencies: Analysis and Computational Modeling
- Leveraging End-to-End ASR for Endangered Language Documentation: An Empirical Study on Yolóxochitl Mixtec
- Data Augmentation for Voice-Assistant NLU using BERT-based Interchangeable Rephrase
- Language Modelling as a Multi-Task Problem
- On the evolution of syntactic information encoded by BERT's contextualized representations
- Calculating the optimal step of arc-eager parsing for non-projective trees
- Subword Pooling Makes a Difference
- VoiSeR: A New Benchmark for Voice-Based Search Refinement
- On the Computational Modelling of Michif Verbal Morphology
- On Hallucination and Predictive Uncertainty in Conditional Language Generation
- Measuring and Improving Faithfulness of Attention in Neural Machine Translation
- Segmenting Subtitles for Correcting ASR Segmentation Errors
- PPT: Parsimonious Parser Transfer for Unsupervised Cross-Lingual Adaptation
- DISK-CSV: Distilling Interpretable Semantic Knowledge with a Class Semantic Vector
- Attention Can Reflect Syntactic Structure (If You Let It)
- Are Neural Networks Extracting Linguistic Properties or Memorizing Training Data? An Observation with a Multilingual Probe for Predicting Tense
- Interpretability for Morphological Inflection: from Character-level Predictions to Subword-level Rules
- From characters to words: the turning point of BPE merges
- WER-BERT: Automatic WER Estimation with BERT in a Balanced Ordinal Classification Paradigm
Machine Learning, Green and Sustainbale NLP, Language Resources and Evaluation
- Unsupervised Sentence-embeddings by Manifold Approximation and Projection
- BERxiT: Early Exiting for BERT with Better Fine-Tuning and Extension to Regression
- Scalable Evaluation and Improvement of Document Set Expansion via Neural Positive-Unlabeled Learning
- How Fast can BERT Learn Simple Natural Language Inference?
- Generative Text Modeling through Short Run Inference
- Randomized Deep Structured Prediction for Discourse-Level Processing
- Memorization vs. Generalization : Quantifying Data Leakage in NLP Performance Evaluation
- BART-TL: Weakly-Supervised Topic Label Generation
- Few-shot Learning for Slot Tagging with Attentive Relational Network
- Active Learning for Sequence Tagging with Deep Pre-trained Models and Bayesian Uncertainty Estimates
- Joint Energy-based Model Training for Better Calibrated Natural Language Understanding Models
- Through the Looking Glass: Learning to Attribute Synthetic Text Generated by Language Models
- Acquiring a Formality-Informed Lexical Resource for Style Analysis
- ADePT: Auto-encoder based Differentially Private Text Transformation
- Conceptual Grounding Constraints for Truly Robust Biomedical Name Representations
- Annealing Knowledge Distillation
- How Good (really) are Grammatical Error Correction Systems?
- Extremely Small BERT Models from Mixed-Vocabulary Training
- Diverse Adversaries for Mitigating Bias in Training
- Joint Learning of Hyperbolic Label Embeddings for Hierarchical Multi-label Classification
- Text Augmentation in a Multi-Task View
- Benchmarking a transformer-FREE model for ad-hoc retrieval
Demo Sessions
- A New Surprise Measure for Extracting Interesting Relationships between Persons
- AnswerQuest: A System for Generating Question-Answer Items from Multi-Paragraph Documents
- ASAD: Arabic Social media Analytics and unDerstanding
- CovRelex: A COVID-19 Retrieval System with Relation Extraction
- Conversational Agent for Daily Living Assessment Coaching Demo
- DebIE: A Platform for Implicit and Explicit Debiasing of Word Embedding Spaces
- Forum 4.0: An Open-Source User Comment Analysis Framework
- Graph Matching and Graph Rewriting: GREW tools for corpus exploration, maintenance and conversion
- ELITR Multilingual Live Subtitling: Demo and Strategy
- FrameForm: An Open-source Annotation Interface for FrameNet
- GCM: A Toolkit for Generating Synthetic Code-mixed Text
- PunKtuator: A Multilingual Punctuation Restoration System for Spoken and Written Text
- Representing ELMo embeddings as two-dimensional text online
- SLTEV: Comprehensive Evaluation of Spoken Language Translation
- SCoT: Sense Clustering over Time: a tool for the analysis of lexical change
- T2NER: Transformers based Transfer Learning Framework for Named Entity Recognition
- Trankit: A Light-Weight Transformer-based Toolkit for Multilingual Natural Language Processing
- Using and comparing Rhetorical Structure Theory parsers with rst-workbench
- Which is Better for Deep Learning: Python or MATLAB? Answering Comparative Questions in Natural Language
Student Research Workshop
- Computationally Efficient Wasserstein Loss for Structured Labels
- Have Attention Heads in BERT Learned Constituency Grammar?
- Do we read what we hear? Modeling orthographic influences on spoken word recognition
- PENELOPIE: Enabling Open Information Extraction for the Greek Language through Machine Translation
- A Computational Analysis of Vagueness in Revisions of Instructional Texts
- A reproduction of Apple's bi-directional LSTM models for language identification in short strings
- Automatically Cataloging Scholarly Articles using Library of Congress Subject Headings
- Model Agnostic Answer Reranking System for Adversarial Question Answering
- BERT meets Cranfield: Uncovering the Properties of Full Ranking on Fully Labeled Data
- Siamese Neural Networks for Detecting Complementary Products
- Contrasting distinct structured views to learn sentence embeddings
- Discrete Reasoning Templates for Natural Language Understanding
- Multilingual Email Zoning
- Familiar words but strange voices: Modelling the influence of speech variability on word recognition
- Emoji-Based Transfer Learning for Sentiment Tasks
- A Little Pretraining Goes a Long Way: A Case Study on Dependency Parsing Task for Low-resource Morphologically Rich Languages
- Development of Conversational AI for Sleep Coaching Programme
- Relating Relations: Meta-Relation Extraction from Online Health Forum Posts
- Towards Personalised and Document-level Machine Translation of Dialogue
- Semantic-aware transformation of short texts using word embeddings: An application in the Food Computing domain
- TMR: Evaluating NER Recall on Tough Mentions
- The Effectiveness of Morphology-aware Segmentation in Low-Resource Neural Machine Translation
- Making Use of Latent Space in Language GANs for Generating Diverse Text without Pre-training
- Beyond the English Web: Zero-Shot Cross-Lingual and Lightweight Monolingual Classification of Registers
- Explaining and Improving BERT Performance on Lexical Semantic Change Detection
- Why Find the Right One?
Zoom Q&A Session 6: Apr 23, (07:00-08:00 UTC)
Language Resources and Evaluation
- ParaSCI: A Large Scientific Paraphrase Dataset for Longer Paraphrase Generation
- NLQuAD: A Non-Factoid Long Question Answering Data Set
- Towards a Decomposable Metric for Explainable Evaluation of Text Generation from AMR
Machine Learning in NLP
- Keep Learning: Self-supervised Meta-learning for Learning from Inference
- Maximal Multiverse Learning for Promoting Cross-Task Generalization of Fine-Tuned Language Models
- AdapterFusion: Non-Destructive Task Composition for Transfer Learning
Phonology, Morphology and Word Segmentation
- Error Analysis and the Role of Morphology
- Facilitating Terminology Translation with Target Lemma Annotations
- Enhancing Sequence-to-Sequence Neural Lemmatization with External Resources
Sentiment Analysis, Stylistic Analysis and Argument Mining
- Exploiting Emojis for Abusive Language Detection
- Is "hot pizza" Positive or Negative? Mining Target-aware Sentiment Lexicons
- End-to-End Argument Mining as Biaffine Dependency Parsing
Zoom Q&A Session 7: Apr 23, (08:00-09:00 UTC)
Natural Language Generation
- Non-Autoregressive Text Generation with Pre-trained Language Models
- Incremental Beam Manipulation for Natural Language Generation
- Enconter: Entity Constrained Progressive Sequence Generation via Insertion-based Transformer
Zoom Q&A Session 8: Apr 23, (12:00-13:00 UTC)
Discourse and Summarization
- AREDSUM: Adaptive Redundancy-Aware Iterative Sentence Ranking for Extractive Document Summarization
- Discourse-Aware Unsupervised Summarization for Long Scientific Documents
- How to Evaluate a Summarizer: Study Design and Statistical Analysis for Manual Linguistic Quality Evaluation
Information Extraction
- Multi-facet Universal Schema
- Identify, Align, and Integrate: Matching Knowledge Graphs to Commonsense Reasoning Tasks
- Do Syntax Trees Help Pre-trained Transformers Extract Information?
Interpretability and Anlysis of NLP Models
- Evaluating Neural Model Robustness for Machine Comprehension
- Coloring the Black Box: What Synesthesia Tells Us about Character Embeddings
- Probing the Probing Paradigm: Does Probing Accuracy Entail Task Relevance?
Gather Session 3: Apr 23, (13:00-15:00 UTC)
Information Extraction, Information Retrieval, Text Categorization and Question Answering
- Query Generation for Multimodal Documents
- Combining Deep Generative Models and Multi-lingual Pretraining for Semi-supervised Document Classification
- Multi-facet Universal Schema
- A Neural Few-Shot Text Classification Reality Check
- Zero-shot Neural Passage Retrieval via Domain-targeted Synthetic Question Generation
- Detecting Extraneous Content in Podcasts
- ChEMU-Ref: A Corpus for Modeling Anaphora Resolution in the Chemical Domain
- Benchmarking Machine Reading Comprehension: A Psychological Perspective
- BERT Prescriptions to Avoid Unwanted Headaches: A Comparison of Transformer Architectures for Adverse Drug Event Detection
- Multilingual Entity and Relation Extraction Dataset and Model
- Identify, Align, and Integrate: Matching Knowledge Graphs to Commonsense Reasoning Tasks
- DOCENT: Learning Self-Supervised Entity Representations from Large Document Collections
- Do Syntax Trees Help Pre-trained Transformers Extract Information?
- Fine-Grained Event Trigger Detection
- On-Device Text Representations Robust To Misspellings via Projections
- Adapting Event Extractors to Medical Data: Bridging the Covariate Shift
- Metric-Type Identification for Multi-Level Header Numerical Tables in Scientific Papers
- Graph-based Fake News Detection using a Summarization Technique
- A Simple Three-Step Approach for the Automatic Detection of Exaggerated Statements in Health Science News
- Complex Question Answering on knowledge graphs using machine translation and multi-task learning
- "Killing Me" Is Not a Spoiler: Spoiler Detection Model using Graph Neural Networks with Dependency Relation-Aware Attention Mechanism
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition
- Two Training Strategies for Improving Relation Extraction over Universal Graph
Dialogue and Interactive Systems, Natural language Generation and Summarization
- The Gutenberg Dialogue Dataset
- Non-Autoregressive Text Generation with Pre-trained Language Models
- Evaluating the Evaluation of Diversity in Natural Language Generation
- Discourse Understanding and Factual Consistency in Abstractive Summarization
- MONAH: Multi-Modal Narratives for Humans to analyze conversations
- Zero-shot Generalization in Dialog State Tracking through Generative Question Answering
- MIDAS: A Dialog Act Annotation Scheme for Open Domain HumanMachine Spoken Conversations
- Alternating Recurrent Dialog Model with Large-scale Pre-trained Language Models
- Quantifying Appropriateness of Summarization Data for Curriculum Learning
- Self-Supervised and Controlled Multi-Document Opinion Summarization
- Few Shot Dialogue State Tracking using Meta-learning
- Globalizing BERT-based Transformer Architectures for Long Document Summarization
- Unsupervised Extractive Summarization using Pointwise Mutual Information
- Incremental Beam Manipulation for Natural Language Generation
- StructSum: Summarization via Structured Representations
- Unsupervised Abstractive Summarization of Bengali Text Documents
- Informative and Controllable Opinion Summarization
- Entity-level Factual Consistency of Abstractive Text Summarization
- Modelling Context Emotions using Multi-task Learning for Emotion Controlled Dialog Generation
- Extractive Summarization Considering Discourse and Coreference Relations based on Heterogeneous Graph
- Summarising Historical Text in Modern Languages
- Enconter: Entity Constrained Progressive Sequence Generation via Insertion-based Transformer
Linguistic Theories, Cognitive Modeling and Psycholinguistics
- Disambiguatory Signals are Stronger in Word-initial Positions
- Telling BERT's Full Story: from Local Attention to Global Aggregation
- Frequency-Guided Word Substitutions for Detecting Textual Adversarial Examples
- Syntactic Nuclei in Dependency Parsing – A Multilingual Exploration
- Searching for Search Errors in Neural Morphological Inflection
- Dependency parsing with structure preserving embeddings
- Error Analysis and the Role of Morphology
- Applying the Transformer to Character-level Transduction
- Paraphrases do not explain word analogies
- First Align, then Predict: Understanding the Cross-Lingual Ability of Multilingual BERT
- Evaluating Neural Model Robustness for Machine Comprehension
- Deep Subjecthood: Higher-Order Grammatical Features in Multilingual BERT
- Coloring the Black Box: What Synesthesia Tells Us about Character Embeddings
- Syntax-BERT: Improving Pre-trained Transformers with Syntax Trees
- Enhancing Sequence-to-Sequence Neural Lemmatization with External Resources
- Cognition-aware Cognate Detection
- Reanalyzing the Most Probable Sentence Problem: A Case Study in Explicating the Role of Entropy in Algorithmic Complexity
- Probing the Probing Paradigm: Does Probing Accuracy Entail Task Relevance?
- Probing for idiomaticity in vector space models
- BERTese: Learning to Speak to BERT
- Supertagging the Long Tail with Tree-Structured Decoding of Complex Categories
- Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement
Machine Learning, Language Resources and Evaluation, Miscellenous NLP
- Keep Learning: Self-supervised Meta-learning for Learning from Inference
- Maximal Multiverse Learning for Promoting Cross-Task Generalization of Fine-Tuned Language Models
- AREDSUM: Adaptive Redundancy-Aware Iterative Sentence Ranking for Extractive Document Summarization
- Exploiting Emojis for Abusive Language Detection
- A Systematic Review of Reproducibility Research in Natural Language Processing
- ParaSCI: A Large Scientific Paraphrase Dataset for Longer Paraphrase Generation
- AdapterFusion: Non-Destructive Task Composition for Transfer Learning
- Is "hot pizza" Positive or Negative? Mining Target-aware Sentiment Lexicons
- End-to-End Argument Mining as Biaffine Dependency Parsing
- Discourse-Aware Unsupervised Summarization for Long Scientific Documents
- Joint Learning of Representations for Web-tables, Entities and Types using Graph Convolutional Network
- NLQuAD: A Non-Factoid Long Question Answering Data Set
- Towards a Decomposable Metric for Explainable Evaluation of Text Generation from AMR
- We Need To Talk About Random Splits
- How to Evaluate a Summarizer: Study Design and Statistical Analysis for Manual Linguistic Quality Evaluation
- Building Representative Corpora from Illiterate Communities: A Reviewof Challenges and Mitigation Strategies for Developing Countries
- Process-Level Representation of Scientific Protocols with Interactive Annotation
- A Study of Automatic Metrics for the Evaluation of Natural Language Explanations
- Adaptive Fusion Techniques for Multimodal Data
- Detecting Scenes in Fiction: A new Segmentation Task
- Disfluency Correction using Unsupervised and Semi-supervised Learning
- Towards More Fine-grained and Reliable NLP Performance Prediction
Machine Translation and Multilnguality
- Multi-split Reversible Transformers Can Enhance Neural Machine Translation
- Bootstrapping Multilingual AMR with Contextual Word Alignments
- Does Typological Blinding Impede Cross-Lingual Sharing?
- Quality Estimation without Human-labeled Data
- El Volumen Louder Por Favor: Code-switching in Task-oriented Semantic Parsing
- Enriching Non-Autoregressive Transformer with Syntactic and Semantic Structures for Neural Machine Translation
- WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia
- A phonetic model of non-native spoken word processing
- The Source-Target Domain Mismatch Problem in Machine Translation
- Multilingual Neural Machine Translation with Deep Encoder and Multiple Shallow Decoders
- Multilingual and cross-lingual document classification: A meta-learning approach
- Lexical Normalization for Code-switched Data and its Effect on POS Tagging
- Cross-Cultural Similarity Features for Cross-Lingual Transfer Learning of Pragmatically Motivated Tasks
- Learning Coupled Policies for Simultaneous Machine Translation using Imitation Learning
- Better Neural Machine Translation by Extracting Linguistic Information from BERT
- CDA: a Cost Efficient Content-based Multilingual Web Document Aligner
- Facilitating Terminology Translation with Target Lemma Annotations
- Multilingual LAMA: Investigating Knowledge in Multilingual Pretrained Language Models
- Meta-Learning for Effective Multi-task and Multilingual Modelling
- Exploring Supervised and Unsupervised Rewards in Machine Translation
- Revisiting Multi-Domain Machine Translation
- Deciphering Undersegmented Ancient Scripts Using Phonetic Prior
Lexical Semantics, Sentence-Level Semantics, and Natural Language Grrounding
- Effects of Pre- and Post-Processing on type-based Embeddings in Lexical Semantic Change Detection
- Exploiting Cloze-Questions for Few-Shot Text Classification and Natural Language Inference
- PolyLM: Learning about Polysemy through Language Modeling
- The Role of Syntactic Planning in Compositional Image Captioning
- Cross-lingual Entity Alignment with Incidental Supervision
- Multiple Tasks Integration: Tagging, Syntactic and Semantic Parsing as a Single Task
- Exploring Transitivity in Neural NLI Models through Veridicality
- SANDI: Story-and-Images Alignment
- Data Augmentation for Hypernymy Detection
- ECOL-R: Encouraging Copying in Novel Object Captioning with Reinforcement Learning
- Few-Shot Semantic Parsing for New Predicates
- Evaluating language models for the retrieval and categorization of lexical collocations
- Semantic Parsing of Disfluent Speech
- An Empirical Study on the Generalization Power of Neural Representations Learned via Visual Guessing Games
- Project-then-Transfer: Effective Two-stage Cross-lingual Transfer for Semantic Dependency Parsing
- Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO
- Co-evolution of language and agents in referential games
- Is Supervised Syntactic Parsing Beneficial for Language Understanding Tasks? An Empirical Investigation
- Unsupervised Word Polysemy Quantification with Multiresolution Grids of Contextual Embeddings
- Removing Word-Level Spurious Alignment between Images and Pseudo-Captions in Unsupervised Image Captioning
- Comparing Knowledge-Intensive and Data-Intensive Models for English Resource Semantic Parsing