Whispering Experts: Toxicity Mitigation in Pre-trained Language Models by Dampening Expert Neurons
An important issue with Large Language Models (LLMs) is their undesired ability to generate toxic language. In this work, we show that the...
International Conference on Machine Learning (ICML) 2024
International Conference on Machine Learning (ICML) 2024
Source link
PINE: Efficient Norm-Bound Verification for Secret-Shared Vectors
Secure aggregation of high-dimensional vectors is a fundamental primitive in federated statistics and learning. A two-server system such as PRIO allows for scalable...
Projected Language Models: A Large Model Pre-Segmented Into Smaller Ones
This paper has been accepted at the Foundation Models in the Wild workshop at ICML 2024.
Large language models are versatile tools but are...
Improving GFlowNets for Text-to-Image Diffusion Alignment
This paper was accepted at the Foundation Models in the Wild workshop at ICML 2024.
Diffusion models have become the de-facto approach for generating...
Towards Automated Accessibility Report Generation for Mobile Apps
Many apps have basic accessibility issues, like missing labels or low contrast. Automated tools can help app developers catch basic issues, but can...
On a Neural Implementation of Brenier’s Polar Factorization
In 1991, Brenier proved a theorem that generalizes the polar decomposition for square matrices -- factored as PSD ×times× unitary -- to any...
Contrasting Multiple Representations with the Multi-Marginal Matching Gap
Learning meaningful representations of complex objects that can be seen through multiple (k≥3kgeq 3k≥3) views or modalities is a core task in machine...
A Direct Algorithm for Multi-Gyroscope Infield Calibration
In this paper, we address the problem of estimating the rotational extrinsics, as well as the scale factors of two gyroscopes rigidly mounted...
CodeAct: Your LLM Agent Acts Better when Generating Code
Large Language Model (LLM) agents, capable of performing a broad range of actions, such as invoking tools and controlling robots, show great potential...