Research Tools: code, datasets, & models

Tool

AMDIM – Augmented Multiscale Deep InfoMax

AMDIM (Augmented Multiscale Deep InfoMax) is an approach to self-supervised representation learning based on maximizing mutual information between features extracted from multiple views of a shared context.

GitHub Publication

Tool

MazeExplorer [1.0.0]

MazeExplorer is a customisable 3D benchmark for assessing generalisation in Reinforcement Learning. It is based on the 3D first-person game Doom and the open-source environment VizDoom. This repository contains the code for the MazeExplorer Gym…

GitHub Publication

Tool

Dead-ends and Secure Exploration in Reinforcement Learning [1.0]

This repository hosts the code for the following ICML 2019 paper: Dead-ends and Secure Exploration in Reinforcement Learning

GitHub Publication

Tool

Decentralized & Collaborative AI on Blockchain [1.0]

We propose a framework for participants to collaboratively build a dataset and use smart contracts to host a continuously updated model.

GitHub Publication

Tool

SeeDot

SeeDot is an automatic quantization tool that generates efficient machine learning (ML) inference code for IoT devices. ML models are usually expressed in floating-point, and IoT devices typically lack hardware support for floating-point arithmetic. Hence,…

GitHub Publication

Tool

Snapshot Serengeti

This data set contains 1.2M sequences of camera trap images, totaling 3.2M images. Species-level labels are provided for 48 species. We have also added approximately 100,000 bounding box annotations to approximately 38,000 images. The images…

Access

Tool

North American Camera Trap Images

This data set contains 3.7M camera trap images from five locations across the United States, with species-level labels for 28 species. More information about this data set is available in the associated manuscript: Tabak, M.…

Access

Tool

Microsoft Speech Corpus (Indian languages)

Microsoft Speech Corpus (Indian languages) release contains conversational and phrasal speech training and test data for Telugu, Tamil and Gujarati languages. The data package includes audio and corresponding transcripts. Data provided in this dataset shall…

Access

Tool

Caltech Camera Traps

This data set contains 244,497 images from 140 camera locations in the Southwestern United States, with species-level labels for 22 species, and approximately 66,000 bounding box annotations.

GitHub

Tool

Tensor Watch Tool for Deep Learning [0.8.0]

TensorWatch is a comprehensive library of tools to debug and monitor training phase for Deep Learning and Reinforcement Learning models as well as perform analysis on trained models. TensorWatch is a debugging and visualization tool…

GitHub