Planting Undetectable Backdoors in Machine Learning Models

no code yet • 14 Apr 2022

Second, we demonstrate how to insert undetectable backdoors in models trained using the Random Fourier Features (RFF) learning paradigm or in Random ReLU networks.

Adversarial Robustness

TWEETS

Training Compute-Optimal Large Language Models

no code yet • 29 Mar 2022

We investigate the optimal model size and number of tokens for training a transformer language model under a given compute budget.

Language Modelling Multi-task Language Understanding +1

TWEETS

Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language

google-research/google-research 1 Apr 2022

In this work, we show that this model diversity is symbiotic, and can be leveraged to build AI systems with structured Socratic dialogue -- in which new multimodal tasks are formulated as a guided language-based exchange between different pre-existing foundation models, without additional finetuning.

Image Captioning

TWEETS

STaR: Bootstrapping Reasoning With Reasoning

no code yet • 28 Mar 2022

We show that STaR significantly improves performance on multiple datasets compared to a model fine-tuned to directly predict final answers, and performs comparably to fine-tuning a 30× larger state-of-the-art language model on CommensenseQA.

Common Sense Reasoning Language Modelling +1

TWEETS

Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

CasualGANPapers/Make-A-Scene 24 Mar 2022

Recent text-to-image generation methods provide a simple yet exciting conversion capability between text and image domains.

Text-to-Image Generation

TWEETS

Video Diffusion Models

no code yet • 7 Apr 2022

Generating temporally coherent high fidelity video is an important milestone in generative modeling research.

Video Generation

TWEETS

Is it worth the effort? Understanding and contextualizing physical metrics in soccer

no code yet • 5 Apr 2022

We present a framework that gives a deep insight into the link between physical and technical-tactical aspects of soccer and it allows associating physical performance with value generation thanks to a top-down approach.

TWEETS

A Roadmap for Big Model

no code yet • 26 Mar 2022

With the rapid development of deep learning, training Big Models (BMs) for multiple downstream tasks becomes a popular paradigm.

Language Modelling Machine Translation +1

TWEETS

The Effects of Regularization and Data Augmentation are Class Dependent

no code yet • 7 Apr 2022

The optimal amount of DA or weight decay found from cross-validation leads to disastrous model performances on some classes e. g. on Imagenet with a resnet50, the "barn spider" classification test accuracy falls from 68% to 46% only by introducing random crop DA during training.

Data Augmentation

TWEETS

Exploring Plain Vision Transformer Backbones for Object Detection

ViTAE-Transformer/ViTDet 30 Mar 2022

This design enables the original ViT architecture to be fine-tuned for object detection without needing to redesign a hierarchical backbone for pre-training.

Object Detection

TWEETS