Planting Undetectable Backdoors in Machine Learning Models

no code yet • 14 Apr 2022

Second, we demonstrate how to insert undetectable backdoors in models trained using the Random Fourier Features (RFF) learning paradigm or in Random ReLU networks.

Adversarial Robustness

946 3613

Paper
Add Code

TWEETS

Training Compute-Optimal Large Language Models

no code yet • 29 Mar 2022

We investigate the optimal model size and number of tokens for training a transformer language model under a given compute budget.

Ranked #1 on Multi-task Language Understanding on MMLU

Language Modelling Multi-task Language Understanding +1

532 3135

Paper
Add Code

TWEETS

Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language

google-research/google-research • • 1 Apr 2022

In this work, we show that this model diversity is symbiotic, and can be leveraged to build AI systems with structured Socratic dialogue -- in which new multimodal tasks are formulated as a guided language-based exchange between different pre-existing foundation models, without additional finetuning.

Image Captioning

383 1707

Paper
Code

TWEETS

STaR: Bootstrapping Reasoning With Reasoning

no code yet • 28 Mar 2022

We show that STaR significantly improves performance on multiple datasets compared to a model fine-tuned to directly predict final answers, and performs comparably to fine-tuning a 30 $\times$ larger state-of-the-art language model on CommensenseQA.

Ranked #9 on Common Sense Reasoning on CommonsenseQA

Common Sense Reasoning Language Modelling +1

166 1040

Paper
Add Code

TWEETS

Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

CasualGANPapers/Make-A-Scene • • 24 Mar 2022

Recent text-to-image generation methods provide a simple yet exciting conversion capability between text and image domains.

Text-to-Image Generation

156 934

Paper
Code

TWEETS

Video Diffusion Models

no code yet • 7 Apr 2022

Generating temporally coherent high fidelity video is an important milestone in generative modeling research.

Video Generation

161 895

Paper
Add Code

TWEETS

Is it worth the effort? Understanding and contextualizing physical metrics in soccer

no code yet • 5 Apr 2022

We present a framework that gives a deep insight into the link between physical and technical-tactical aspects of soccer and it allows associating physical performance with value generation thanks to a top-down approach.

123 800

Paper
Add Code

TWEETS

A Roadmap for Big Model

no code yet • 26 Mar 2022

With the rapid development of deep learning, training Big Models (BMs) for multiple downstream tasks becomes a popular paradigm.

Language Modelling Machine Translation +1

134 682

Paper
Add Code

TWEETS

The Effects of Regularization and Data Augmentation are Class Dependent

no code yet • 7 Apr 2022

The optimal amount of DA or weight decay found from cross-validation leads to disastrous model performances on some classes e. g. on Imagenet with a resnet50, the "barn spider" classification test accuracy falls from $68 %$ to $46 %$ only by introducing random crop DA during training.

Data Augmentation

154 708

Paper
Add Code

TWEETS

Exploring Plain Vision Transformer Backbones for Object Detection

ViTAE-Transformer/ViTDet • • 30 Mar 2022

This design enables the original ViT architecture to be fine-tuned for object detection without needing to redesign a hierarchical backbone for pre-training.

Object Detection

122 755

Paper
Code

TWEETS

Hot Research on Social Media