Apple Workshop on Natural Language Understanding 2024
Progress in natural language processing enables more intuitive ways of interacting with technology. For example, many of Apple’s products and services, including Siri...
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
Large Language Models (LLMs) have transformed natural language processing, but face significant challenges in widespread deployment due to their high runtime cost. In...
Interpreting and Improving Optimal Control Problems With Directional Corrections
Many robotics tasks, such as path planning or trajectory optimization, are formulated as optimal control problems (OCPs). The key to obtaining high performance...
Mutual Reinforcement of LLM Dialogue Synthesis and Summarization Capabilities for Few-Shot Dialogue Summarization
In this work, we propose Mutual Reinforcing Data Synthesis (MRDS) within LLMs to improve few-shot dialogue summarization task. Unlike prior methods that require...
Modeling Speech Emotion With Label Variance and Analyzing Performance Across Speakers and Unseen Acoustic...
Spontaneous speech emotion data usually contain perceptual grades where graders assign emotion score after listening to the speech files. Such perceptual grades introduce...
Universally Instance-Optimal Mechanisms for Private Statistical Estimation
We consider the problem of instance-optimal statistical estimation under the constraint of differential privacy where mechanisms must adapt to the difficulty of the...
The Role of Prosody in Spoken Question Answering
Spoken language understanding research to date has generally carried a heavy text perspective. Most datasets are derived from text, which is then subsequently...
Fundamental Challenges in Evaluating Text2SQL Solutions and Detecting Their Limitations
In this work, we dive into the fundamental challenges of evaluating Text2SQL solutions and highlight potential failure causes and the potential risks of...
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
Text-to-Image (T2I) diffusion models have shown impressive results in generating visually compelling images following user prompts. Building on this, various methods further fine-tune...
Exploring Empty Spaces: Human-in-the-Loop Data Augmentation
Data augmentation is crucial to make machine learning models more robust and safe. However, augmenting data can be challenging as it requires generating...