A Neural Method for Goal-Oriented Dialog Systems to interact with Named EntitiesMany goal-oriented dialog tasks, especially ones in which the dialog system has to interact with external knowledge sources such as databases, have to handle a large number of Named Entities (NEs). There are at least two challenges in handling NEs using neural methods in such settings: individual NEs may occur only rarely making it hard to learn good representations of them, and many of the Out Of Vocabulary words that occur during test time may be NEs. Thus, the need to interact well with these NEs has emerged as a serious challenge to building neural methods for goal-oriented dialog tasks. In this work, we propose a new neural method for this problem and present empirical evaluations on a structured Question Answering task and goal-oriented dialog tasks and show that our proposed method can be effective in interacting with NEs in these settings.
Translation of Complex Natural Language Queries into SQLIn natural language, the same idea can be expressed in many different ways: not only using different words but also using entirely different sentence structures. In databases too we can have radically different schema designs to represent the same information. Even against a specific schema, we can have multiple equivalent query expressions. This great variety makes it difficult to learn translations, and makes it unlikely that any rule-based translation that works for one natural language statement will work for another. To address these challenges, we propose to divide and conquer the problem. We use deep-learning to address variations in natural language vocabulary; we use past history to determine desired query structure; we use database statistics for entity disambiguation.
World Knowledge for Semantic Parsing with Abstract Meaning RepresentationIn our experiments, we introduce modifications to a parser for Abstract Meaning Representation that allow it to more accurately identify concepts and named entities in semantic sentence representations. We look at the types of errors that currently exist in a state of the art parser and explore the problem of how to integrate world knowledge to reduce these errors. An examination of the limitations of these types of features is included, which provides insight into the potential for world knowledge to benefit future work in AMR parsing.
Crowd Annotation of Expert Content
AI for disentanglement and conversation graph extractionDialogue often consists of multiple threads of conversation mixed together, with a complex structure in which an utterance could be responding to multiple previous utterances of which some might be far back in the conversation, and there could be multiple utterances that respond to a single one. One key step in learning to understand conversation is disentangling these threads. We intend to develop neural network based models that can extract the graph representing the conversation, along with a hand-labeled dataset for evaluation. We will focus on the Ubuntu corpus, but also consider smaller IRC datasets used in prior work.
Nikita Bhutani, Kun Qian, Yunyao Li, H. V. Jagadish, Mauricio A. Hernández, Mitesh Vasa. Exploiting Structure in Representation of Named Entities using Active Learning. In CoLing 2018.
Kun Qian, Nikita Bhutani, Yunyao Li, H. V. Jagadish, Mauricio A. Hernández: LUSTRE: An Interactive System for Entity Structured Representation and Variant Generation. In ICDE 2018.
Chris Baik, H. V. Jagadish, Yunyao Li. Bridging the Semantic Gap with SQL Query Logs in Natural Language Interfaces to Databases. In ICDE 2019.
Catherine Finegan-Dollak*, Jonathan K. Kummerfeld*, Li Zhang, Karthik Ramanathan, Sesh Sadasivam, Rui Zhang, and Dragomir Radev. Improving Text-to-SQL Evaluation Methodology. In ACL 2018. [Data and Code]
Koichiro Yoshino, Chiori Hori, Julien Perez, Luis Fernando D'Haro, Lazaros Polymenakos, Chulaka Gunasekara, Walter S. Lasecki, Jonathan K. Kummerfeld, Michel Galley, Chris Brockett, Jianfeng Gao, Bill Dolan, Xiang Gao, Huda Alamari, Tim K. Marks, Devi Parikh and Dhruv Batra. Dialog System Technology Challenge 7. In ConvAI at NeurIPS 2018.
Charles Welch, Jonathan K. Kummerfeld, Song Feng, and Rada Mihalcea. World Knowledge for Abstract Meaning Representation Parsing. In LREC 2018.
Youxuan Jiang, Catherine Finegan-Dollak, Jonathan K. Kummerfeld, and Walter Lasecki. Effective Crowdsourcing for a New Type of Summarization Task. In NAACL 2018.
Rui Zhang, Honglak Lee, Lazaros Polymenakos, and Dragomir Radev. Addressee and Response Selection in Multi-Party Conversations with Speaker Interaction RNNs. In Proceedings of the Association for the Advancement of AI (AAAI 2018). New Orleans, LA.
Lajanugen Logeswaran, Honglak Lee, and Dragomir Radev. Sentence Ordering using Recurrent Neural Networks. In Proceedings of the Association for the Advancement of AI (AAAI 2018). New Orleans, LA.
Joseph J Williams, Anna N. Rafferty, Dustin Tingley, Andrew Ang, Walter S. Lasecki, and Juho Kim. Enhancing Online Problems Through Instructor-Centered Tools for Randomized Experiments. In Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI 2018). Montreal, Canada.
Harmanpreet Kaur, Mitchell Gordon, Yiwei Yang, Jeffrey P. Bigham, Jaime Teevan, Ece Kamar, and Walter S. Lasecki. CrowdMask: Using Crowds to Preserve Privacy in Crowd-Powered Systems via Progressive Filtering. In Proceedings of the AAAI Conference on Human Computation (HCOMP 2017). Quebec City, Canada.
Youxuan (Lucy) Jiang, Jonathan K. Kummerfeld, and Walter S. Lasecki. Understanding Task Design Trade-offs in Crowdsourced Paraphrase Collection. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL 2017). Vancouver, Canada.
Xiaoxiao Guo, Tim Klinger, Clemens Rosenbaum, Joseph P. Bigus, Murray Campbell, Ban Kawas, Kartik Talamadupula, Gerald Tesauro, and Satinder Singh. Learning to Query, Reason, and Answer Questions On Ambiguous Texts. In Proceedings of the 6th International Conference on Learning Representations (ICLR 2017). Toulon, France.
Yelin Kim and Emily Mower Provost. ISLA: Temporal Segmentation and Labeling for Audio-Visual Emotion Recognition. IEEE Transactions on Affective Computing, vol: To appear, 2017.
Soheil Khorram, Zakariah Aldeneh, Dimitrios Dimitriadis, Melvin McInnis, and Emily Mower Provost. Capturing Long-term Temporal Dependencies with Convolutional Networks for Continuous Emotion. Interspeech. Stockholm, Sweden, August 2017.
John Gideon, Soheil Khorram, Zakariah Aldeneh, Dimitrios Dimitriadis, and Emily Mower Provost. Progressive Neural Networks for Transfer Learning in Emotion Recognition. Interspeech. Stockholm, Sweden, August 2017.
Duc Le, Zakariah Aldeneh, and Emily Mower Provost. Discretized Continuous Speech Emotion Recognition with Multi-Task Deep Recurrent Neural Network. Interspeech. Stockholm, Sweden, August 2017.
Quan Chen, Hailong Yang, Minyi Guo, Ram Srivatsa Kannan, Jason Mars, and Lingjia Tang. Prophet: Precise QoS Prediction on Non-Preemptive Accelerators to Improve Utilization in Warehouse-Scale Computers. In Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 17-32. ACM, 2017.
Hailong Yang, Quan Chen, Moeiz Riaz, Zhongzhi Luan, Lingjia Tang, and Jason Mars. PowerChief: Intelligent Power Allocation for Multi-Stage Applications to Improve Responsiveness on Power Constrained CMP. In Proceedings of the 44th Annual International Symposium on Computer Architecture, pp. 133-146. ACM, 2017.
Quan Chen, Hailong Yang, Jason Mars, and Lingjia Tang. Baymax: Qos awareness and increased utilization for non-preemptive accelerators in warehouse scale computers. In ACM SIGPLAN Notices, vol. 51, no. 4, pp. 681-696. ACM, 2016.
Charles Welch and Rada Mihalcea. Targeted Sentiment to Understand Student Comments. In Proceedings of the 26th International Conference on Computational Linguistics (COLING 2016). Osaka, Japan.