Background/Theory

StatQuest with Josh Starmer (YouTube)

Tagline

DOUBLE BAM!!!

Link

https://www.youtube.com/channel/UCtYLUTtgS3k1Fg4y5tAhLbw

Statistics, Machine Learning and Data Science can sometimes seem like very scary topics, but since each technique is really just a combination of small and simple steps, they are actually quite simple. My goal with StatQuest is to break down the major methodologies into easy to understand pieces. That said, I don’t dumb down the material. Instead, I build up your understanding so that you are smarter.

Notable Playlists:

The Asimov Institute’s Neural Network Zoo

Link 1

https://www.asimovinstitute.org/neural-network-zoo-prequel-cells-layers/

Link 2

https://www.asimovinstitute.org/neural-network-zoo/

Provides cheat sheets depicting various types of neural network architectures and their components (included below). Also contains brief, entry-level descriptions of each type and links to relevant publications (not shown below; please visit the corresponding webpage to view the full contents).

Free Online Books, Handbooks, and Guides

Tim-R413’s Deep Learning Handbook

Tagline

A series of modules meant to assist in learning different areas of deep learning through interactive tutorials.

Repository

https://github.com/Tim-R413/Deep-Learning-Handbook

Start Machine Learning

Tagline

A complete guide to start and improve in machine learning (ML), artificial intelligence (AI) in 2023 without ANY background in the field and stay up-to-date with the latest news and state-of-the-art techniques!

Introductory video

https://youtu.be/RirEw-uaS_8

Repository

https://github.com/louisfb01/start-machine-learning

This guide is intended for anyone having zero or a small background in programming, maths, and machine learning. There is no specific order to follow, but a classic path would be from top to bottom. If you don’t like reading books, skip it, if you don’t want to follow an online course, you can skip it as well. There is not a single way to become a machine learning expert and with motivation, you can absolutely achieve it.

All resources listed here are free, except some online courses and books, which are certainly recommended for a better understanding, but it is definitely possible to become an expert without them, with a little more time spent on online readings, videos and practice. When it comes to paying courses, the links in this guide are affiliated links. Please, use them if you feel like following a course as it will support me. Thank you, and have fun learning! Remember, this is completely up to you and not necessary. I felt like it was useful to me and maybe useful to others as well.

Designing ML Systems summary

Repository

https://github.com/serodriguez68/designing-ml-systems-summary

A detailed summary of Designing Machine Learning Systems by Chip Huyen. This book gives you an end-to-end view of all the steps required to build AND OPERATE ML products in production. It is a must-read for ML practitioners and Software Engineers transitioning into ML.

Infographic and Diagram Collections

Collections of Visuals and Educational Graphics

https://github.com/dvgodoy/dl-visuals

The AI Summer’s Deep Learning Visuals

Repository

https://github.com/The-AI-Summer/deep-learning-visuals

DAIR.AI’s ML Visuals

Tagline

ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

Repository

https://github.com/dair-ai/ml-visuals

Terminology Guides

Machine Learning Glossary

Tagline

This is an online glossary of terms for artificial intelligence, machine learning, computer vision, natural language processing, and statistics.

Link

https://machinelearning.wtf

Repository

https://github.com/machine-learning-glossary/glossary

Other Collections

gmihaila’s ML Things

Repository

https://github.com/gmihaila/ml_things

This is where I put things I find useful that speed up my work with Machine Learning. Ever looked in your old projects to reuse those cool functions you created before? Well, this repo is designed to be a Python Library of functions I created in my previous project that can be reused. I also share some Notebooks Tutorials and Python Code Snippets.

	Table of contents
Installation	Details on how to install `ml_things`.
Array Functions	Details on the ml_things array related functions (`pad_array`, `batch_array`)
Plot Functions	Details on the ml_things plot related functions (`plot_array`, `plot_dict`, `plot_confusion_matrix`)
Text Functions	Details on the ml_things text related functions (`clean_text`)
Web Related	Details on the `ml_things` web related functions (download_from)
Snippets	Curated list of Python snippets I frequently use.
Comments	Sample on how I like to comment my code. It is still a work in progress.
Notebooks Tutorials	Machine learning projects that I converted to tutorials and posted online.
Final Note	Being grateful.

nivu’s AI All Resources

Tagline

A curated list of Best Artificial Intelligence Resources

Repository

https://github.com/nivu/ai_all_resources

Example Neural Network Architecture Collections

Nets4Learning

Tagline

Web platform for the design and execution of deep learning models for learning and initiation in the study of deep learning models.

Repository

https://github.com/SIMIDAT/nets4learning

The tool proposes different classical machine learning problems with known data sets to study and model different neural network architectures and training parameters. The tool addresses different examples of deep learning models such as tabular classification, image classifier or object identification.

There are some classical problems prepared and reviewed to make predictions, the tool has the feature to preprocess data sets that the user uploads, train models and predict in which class it would be classified.

Flux Model Zoo

Tagline

Please do not feed the models

Repository

https://github.com/FluxML/model-zoo

This repository contains various demonstrations of the Flux machine learning library. Any of these may freely be used as a starting point for your own models.

The models are broadly categorised into the folders vision (e.g. large convolutional neural networks (CNNs)), text (e.g. various recurrent neural networks (RNNs) and natural language processing (NLP) models), games (Reinforcement Learning / RL). See the READMEs of respective models for more information.

Note that Flux is a library written for the Julia programming langauge (i.e. not Python).

Recommended Websites

Articles

distill

Description

Link

https://distill.pub/

Selected Articles

Attention and Augmented Recurrent Neural Networks

Link

https://distill.pub/2016/augmented-rnns/

How to Use t-SNE Effectively

Link

https://distill.pub/2016/misread-tsne/

Although extremely useful for visualizing high-dimensional data, t-SNE plots can sometimes be mysterious or misleading. By exploring how it behaves in simple cases, we can learn to use it more effectively.

Feature Visualization

Link

https://distill.pub/2017/feature-visualization/

Sequence Modeling with CTC

Tagline

A visual guide to Connectionist Temporal Classification, an algorithm used to train deep neural networks in speech recognition, handwriting recognition and other sequence problems.

Link

https://distill.pub/2017/ctc/

The Building Blocks of Interpretability

Link

https://distill.pub/2018/building-blocks/

Interpretability techniques are normally studied in isolation. We explore the powerful interfaces that arise when you combine them — and the rich structure of this combinatorial space. For instance, by combining feature visualization (what is a neuron looking for?) with attribution (how does it affect the output?), we can explore how the network decides between labels like Labrador retriever and tiger cat.

Feature-Wise Transformations

Tagline

A simple and surprisingly effective family of conditioning mechanisms.

Link

https://distill.pub/2018/feature-wise-transformations/

Many real-world problems require integrating multiple sources of information. Sometimes these problems involve multiple, distinct modalities of information — vision, language, audio, etc. — as is required to understand a scene in a movie or answer a question about an image. Other times, these problems involve multiple sources of the same kind of input, i.e. when summarizing several documents or drawing one image in the style of another.

Exploring Neural Networks with Activation Atlases

Link

By using feature inversion to visualize millions of activations from an image classification network, we create an explorable activation atlas of features the network has learned which can reveal how the network typically represents some concepts.

Visualizing memorization in RNNs

Tagline

Inspecting gradient magnitudes in context can be a powerful tool to see when recurrent units use short-term or long-term contextual understanding.

Link

https://distill.pub/2019/memorization-in-rnns/

This article presents a qualitative visualization method for comparing recurrent units with regards to memorization and contextual understanding. The method is applied to the three recurrent units mentioned above: Nested LSTMs, LSTMs, and GRUs.

Paths Perspective on Value Learning

Tagline

A closer look at how Temporal Difference learning merges paths of experience for greater statistical efficiency.

Link

https://distill.pub/2019/paths-perspective-on-value-learning/

::![](assets/2023-12-11-21-42-47.png)

Computing Receptive Fields of Convolutional Neural Networks

<div class="infobox-row"><div class="infobox-head">Tagline</div><div class="infobox-spacer"></div><div class="infobox-tail">Mathematical derivations and <a href="https://github.com/google-research/receptive_field">open-source library</a> to compute receptive fields of convnets, enabling the mapping of extracted features to input signals.</div></div>

<div class="infobox-row"><div class="infobox-head">Link</div><div class="infobox-spacer"></div><div class="infobox-tail"><a href="https://distill.pub/2019/computing-receptive-fields/">https://distill.pub/2019/computing-receptive-fields/</a></div></div>

Visualizing Neural Networks with the Grand Tour

Tagline

Using Visualization to Understand DNNs

Link

https://distill.pub/2020/grand-tour/

The Grand Tour is a classic visualization technique for high-dimensional point clouds that projects a high-dimensional dataset into two dimensions. Over time, the Grand Tour smoothly animates its projection so that every possible view of the dataset is (eventually) presented to the viewer. …

In this article, we show how to leverage the linearity of the Grand Tour to enable a number of capabilities that are uniquely useful to visualize the behavior of neural networks.

Concretely, we present three use cases of interest:

visualizing the training process as the network weights change,

visualizing the layer-to-layer behavior as the data goes through the network[,] and

visualizing both how adversarial examples are crafted and how they fool a neural network.

Understanding RL (Reinforcement Learning) Vision

Tagline

With diverse environments, we can analyze, diagnose and edit deep reinforcement learning models using attribution.

Link

https://distill.pub/2020/understanding-rl-vision/

Demos and Playgrounds

Attention Visualization

AttViz

Tagline

Self attention made simple; Dissecting Transformers via attention visualization

Live demo

http://attviz.ijs.si/

Repository

https://github.com/SkBlaz/attviz

BertViz

Tagline

Visualize Attention in NLP Models

Live demo

See below

Repository

https://github.com/jessevig/bertviz

BertViz is an interactive tool for visualizing attention in Transformer language models such as BERT, GPT2, or T5. It can be run inside a Jupyter or Colab notebook through a simple Python API that supports most Huggingface models. BertViz extends the Tensor2Tensor visualization tool by Llion Jones, providing multiple views that each offer a unique lens into the attention mechanism.

The repository link contains information about usage as well as several links to interactive tutorial Colab notebooks.

attention-viz

Tagline

Visualizing query-key interactions in language + vision transformers

Live demo

https://attentionviz.com/

Documentation

https://catherinesyeh.github.io/attn-docs/

Repository

https://github.com/catherinesyeh/attention-viz

Attention Viz is an interactive tool that visualizes global attention patterns for transformer models. To create this tool, we visualize the joint embeddings of query and key vectors.

attentions

Tagline

An Apache 2.0 PyTorch implementation of some attentions for Deep Learning Researchers.

Repository

https://github.com/sooftware/attentions

Implementation List
Name	Citation
Additive attention	Bahdanau et al., 2015
Dot-product attention	Luong et al., 2015
Location-Aware (Location Sensitive) Attention	Chorowski et al., 2015
Scaled Dot-Product Attention	Vaswani et al., 2017
Multi-Head Attention	Vaswani et al., 2017
Relative Multi-Head Self Attention	ZihangDai et al., 2019

Other

Anomagram

Tagline

Interactive Visualization for Autoencoders with Tensorflow.js

Live Demo

https://anomagram.fastforwardlabs.com/#/

Repository

https://github.com/victordibia/anomagram

Anomagram is an interactive experience built with Tensorflow.js to demonstrate how deep neural networks (autoencoders) can be applied to the task of anomaly detection.

DRLViz

Tagline

Online demo of DRLViz, an interactive tool to understand decisions and memory in Deep Reinforcement Learning

Live Demo

https://sical.github.io/drlviz/

Repository

https://github.com/sical/drlviz

VisualML

Tagline

TODO

Live Demo

(multiple --see below)

Repository

https://github.com/dsgiitr/VisualML

Visual Machine Learning contains a set of Machine Learning and Deep Learning interactive visualisation demos for developing intuition. These demos are developed using TensorFlow.js and can be executed directly in your browser.

Live demo links:

RNN Explainer

Tagline

An interactive visualization application designed to help non-experts learn about Recurrent Neural Networks (RNNs).

Live demo

https://damien0x0023.github.io/rnnExplainer/

Repository

https://github.com/damien0x0023/rnnExplainer

Info

(Recommendation: “try this demo with a screen which is larger than 8 inches and has a minimum resolution of 1280x720”)

CNN Explainer

Tagline

Learning Convolutional Neural Networks with Interactive Visualization;
An interactive visualization system designed to help non-experts learn about Convolutional Neural Networks (CNNs)

Live demo

http://poloclub.github.io/cnn-explainer/

Repository

https://github.com/poloclub/cnn-explainer?tab=readme-ov-file

Diffusion Explainer

Tagline

Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion

Live demo

http://poloclub.github.io/diffusion-explainer

Demo video

https://youtu.be/Zg4gxdIWDds

Research paper

https://arxiv.org/abs/2305.03509

Blog post

https://medium.com/@seongminleee/77b53f4f1c4

Repository

https://github.com/poloclub/diffusion-explainer

Wizmap

Tagline

Explore and interpret large embeddings in your browser with interactive visualization! 📍

Live demo

https://poloclub.github.io/wizmap

Demo video

https://youtu.be/8fJG87QVceQ

Research paper

https://arxiv.org/abs/2306.09328

Repository

https://github.com/poloclub/wizmap

The repository includes an interactive notebook containing instructions for using your own embeddings with WizMap.

WizMap is a scalable interactive visualization tool to help you easily explore large machine learning embeddings. With a novel multi-resolution embedding summarization method and a familiar map-like interaction design, WizMap allows you to navigate and interpret embedding spaces with ease.


DiffusionDB Prompts + Images	ACL Paper Abstracts	IMDB Review Comments
1.8M text + 1.8M images	63k text	25k text
`CLIP` Embedding	`all-MiniLM-L6-v2` Embedding	`all-MiniLM-L6-v2` Embedding

GanLab

Tagline

GAN Lab: An Interactive, Visual Experimentation Tool for Generative Adversarial Networks

Live demo

https://poloclub.github.io/ganlab/

Repository

https://github.com/poloclub/ganlab?tab=readme-ov-file

TimberTrek

Tagline

Explore and compare 1K+ accurate decision trees in your browser!

Live demo

https://poloclub.github.io/timbertrek

Demo video

https://youtu.be/3eGqTmsStJM

Conference Talk

https://youtu.be/l1mr9z1TuAk

Research paper

https://arxiv.org/abs/2209.09227

Colab notebook

https://github.com/ubc-systopia/treeFarms/blob/main/treefarms/tutorial.ipynb

Repository

https://github.com/poloclub/timbertrek

GAM Coach

Tagline

An interactive tool to help everyday users discover actionable strategies to obtain desired AI decisions.

Live demo

https://poloclub.github.io/gam-coach/

Demo video

https://youtu.be/ubacP34H9XE

Research paper

https://arxiv.org/abs/2302.14165

Repository

https://github.com/poloclub/gam-coach

Interactive Classification

Tagline

Interactive Classification for Deep Learning Interpretation

Live demo

https://cabreraalex.github.io/interactive-classification/

Repository

https://github.com/poloclub/interactive-classification

The live demo includes a “tour”-style tutorial.

Interactive Classification allows you to explore how computers see by modifying images.

WebSHAP

Tagline

JavaScript library to explain any machine learning models anywhere!

Live Demo

(See below.)

Library documentation

http://poloclub.github.io/webshap/doc

Research paper

https://arxiv.org/abs/2303.09545

Repository

https://github.com/poloclub/webshap

Live Demo List (see Repository README.md for more info)

Financial ML model for predictive classification

Convolutional NN for image classification

Transformer-based text classifier

Bluff

Tagline

Bluff: Interactively Deciphering Adversarial Attacks on Deep Neural Networks

Live Demo

https://poloclub.github.io/bluff/

Demo video

https://youtu.be/b6PHyYnassc

Research paper

https://arxiv.org/pdf/2009.02608.pdf

Repository

https://github.com/poloclub/bluff

Dodrio

Tagline

Exploring attention weights in transformer-based models with linguistic knowledge.

Live demo

http://poloclub.github.io/dodrio/

Research paper

https://arxiv.org/abs/2103.14625

Repository

https://github.com/poloclub/dodrio

An interactive visualization system designed to help NLP researchers and practitioners analyze and compare attention weights in transformer-based models with linguistic knowledge.

Neuro-Cartography

Tagline

Scalable Automatic Visual Summarization of Concepts in Deep Neural Networks

Live demo

https://poloclub.github.io/neuro-cartography/

Demo video

https://youtu.be/gx0dDNXFJA0

Research paper

https://arxiv.org/abs/2108.12931

Repository

https://github.com/poloclub/neuro-cartography

Visual Auditor

Tagline

Interactive scalable auditing of model biases and vulnerabilities with interpretable mitigation;
An interactive visualization system for identifying and understanding biases in machine learning models.

Live demo

https://visual-auditor.surge.sh/

Repository

https://github.com/poloclub/visual-auditor

TeleGam

Tagline

TeleGam: Combining Visualization and Verbalization for Interpretable Machine Learning

Live demo

https://poloclub.github.io/telegam/

Demo video

https://youtu.be/DKeyhVIABlQ

Repository

https://github.com/poloclub/telegam

TeleGam is a prototype system that demonstrates how visualizations and verbalizations can collectively support interactive interpretation of machine learning models, for example, generalized additive models (GAMs).

Convolution Sandbox

:: Tagline:

Live demo

https://antoinebrl.github.io/blog/conv1d/

Repository

https://github.com/antoinebrl/convolution1d-sandbox

Convolutions are core to deep learning recent success, especially in computer vision. This interactive visualization help to grasp a better understanding of the step-by-step processing.

User can select different kernels and input signals among the predefined functions. Another option drag the dots to the wanted level. The app also illustrates the importance of the padding, the dilatation and stride parameters.

Additional Resource Collections

Machine Learning Tokyo’s Interactive Tools

Repository

https://github.com/Machine-Learning-Tokyo/Interactive_Tools

A small sampling of the contents:

geosci.ai

Tagline

This mini-site hosts a series of experiments in artificial intelligence in the field of geoscience.

Link

https://geosci.ai/

Contains 9 interactive applications and utilities involving geoscience. About half of them are specifically AI-related as well.

Other Interactive Resource Collections

https://github.com/stared/interactive-machine-learning-list

Topic-Specific Resources

Classification and Regression

Deep Learning and NN Architecture

Transformers

ashishpatel26’s Treasure of Transformers

Tagline

💁 Awesome Treasure of Transformers Models for Natural Language processing contains papers, videos, blogs, official repo along with colab Notebooks. 🛫☑️

Repository

https://github.com/ashishpatel26/Treasure-of-Transformers

Specific ML Frameworks

PyTorch

PyTorch 101 Tutorial Series

Link

https://blog.paperspace.com/pytorch-101-understanding-graphs-and-automatic-differentiation

Repository

https://github.com/Paperspace/PyTorch-101-Tutorial-Series

The repository link contains interactive notebooks corresponding to each blog post in the series.

Series Contents:

bharathgs’ Awesome PyTorch List

Tagline

A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.

Repository

https://github.com/bharathgs/Awesome-pytorch-list

TensorFlow (+Keras)

Hands-On Machine Learning…

Tagline

Repository

https://github.com/Akramz/Hands-on-Machine-Learning-with-Scikit-Learn-Keras-and-TensorFlow

Course Material

MIT Deep Learning

Tagline

Tutorials, assignments, and competitions for MIT Deep Learning related courses.

Repository

https://github.com/lexfridman/mit-deep-learning

Contains links to lecture videos and several Jupyter notebooks and Google Colab notebooks for various task-oriented tutorials.

ML Frameworks

C/C++ Frameworks

DeepC

Tagline

vendor independent TinyML deep learning library, compiler and inference framework microcomputers and micro-controllers

Repository

https://github.com/ai-techsystems/deepC

Repository README.md contains links to colab notebook and other reference material.

The deepC is a vendor independent deep learning library, compiler and inference framework designed for small form-factor devices including μControllers, IoT and Edge devices.

Python Frameworks

OpenPrompt

Tagline

An Open-Source Framework for Prompt-Learning.

Repository

https://github.com/thunlp/OpenPrompt

Julia Frameworks

Flux

Tagline

The Elegant Machine Learning Stack

Website

https://fluxml.ai/

Repository

https://github.com/FluxML/Flux.jl

Flux is a 100% pure-Julia stack and provides lightweight abstractions on top of Julia’s native GPU and AD support. It makes the easy things easy while remaining fully hackable.

MLJ

Tagline

A Julia machine learning framework

Learning guides

https://alan-turing-institute.github.io/MLJ.jl/dev/learning_mlj/

Documentation

https://alan-turing-institute.github.io/MLJ.jl/dev/

Cheatsheet

https://alan-turing-institute.github.io/MLJ.jl/dev/mlj_cheatsheet/

Repository

https://github.com/alan-turing-institute/MLJ.jl

MLJ (Machine Learning in Julia) is a toolbox written in Julia providing a common interface and meta-algorithms for selecting, tuning, evaluating, composing and comparing about 200 machine learning models written in Julia and other languages.

Focus is mainly not on Deep Learning techniques.

Other Languages

Framework Interop

Reinforcement Learning initiatives

DeepRTS

Tagline

A Real-Time-Strategy game for Deep Learning research

Repository

https://github.com/cair/deep-rts

DeepRTS is a high-performance Real-TIme strategy game for Reinforcement Learning research. It is written in C++ for performance, but provides an python interface to better interface with machine-learning toolkits. Deep RTS can process the game with over 6 000 000 steps per second and 2 000 000 steps when rendering graphics. In comparison to other solutions, such as StarCraft, this is over 15 000% faster simulation time running on Intel i7-8700k with Nvidia RTX 2080 TI.

The aim of Deep RTS is to bring a more affordable and sustainable solution to RTS AI research by reducing computation time.

Google Deepmind’s `lab`

Tagline

A customisable 3D platform for agent-based AI research

Repository

https://github.com/google-deepmind/lab

DeepMind Lab is a 3D learning environment based on id Software’s Quake III Arena via ioquake3 and other open source software.

DeepMind Lab provides a suite of challenging 3D navigation and puzzle-solving tasks for learning agents. Its primary purpose is to act as a testbed for research in artificial intelligence, especially deep reinforcement learning.

(Click the images below to watch each of the three demo videos on YouTube.)

DeepMind Lab - Laser Tag Space Bounce Level (Hard)

Gymnasium

Tagline

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities

Documentation

https://gymnasium.farama.org/

Repository

https://github.com/Farama-Foundation/Gymnasium

Successor to OpenAI’s Gym (website, repository).

Gymnasium includes the following families of environments along with a wide variety of third-party environments

Classic Control - These are classic reinforcement learning based on real-world problems and physics.

Box2D - These environments all involve toy games based around physics control, using box2d based physics and PyGame-based rendering

Toy Text - These environments are designed to be extremely simple, with small discrete state and action spaces, and hence easy to learn. As a result, they are suitable for debugging implementations of reinforcement learning algorithms.

MuJoCo - A physics engine based environments with multi-joint control which are more complex than the Box2D environments.

Atari - A set of 57 Atari 2600 environments simulated through Stella and the Arcade Learning Environment that have a high range of complexity for agents to learn.

Third-party - A number of environments have been created that are compatible with the Gymnasium API. Be aware of the version that the software was created for and use the apply_env_compatibility in gymnasium.make if necessary.

ML Libraries

AutoML

Python

Optuna

Tagline

A hyperparameter optimization framework

Documentation

https://optuna.readthedocs.io/en/stable/

Optuna is an automatic hyperparameter optimization software framework, particularly designed for machine learning. It features an imperative, define-by-run style user API. Thanks to our define-by-run API, the code written with Optuna enjoys high modularity, and the user of Optuna can dynamically construct the search spaces for the hyperparameters.

Optuna has modern functionalities as follows:

Lightweight, versatile, and platform agnostic architecture

Handle a wide variety of tasks with a simple installation that has few requirements.

Pythonic search spaces

*Define search spaces using familiar Python syntax including conditionals and loops.*

Efficient optimization algorithms

Adopt state-of-the-art algorithms for sampling hyperparameters and efficiently pruning unpromising trials.

Easy parallelization

Scale studies to tens or hundreds of workers with little or no changes to the code.

Quick visualization

Inspect optimization histories from a variety of plotting functions.

Optuna is popular and is generally regarded as accessible to beginners.

Hyperopt

Tagline

Distributed Asynchronous Hyper-parameter Optimization

Documentation

https://hyperopt.github.io/hyperopt/

Repository

https://github.com/hyperopt/hyperopt

NNI

Tagline

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Documentation

https://nni.readthedocs.io/

Repository

https://github.com/microsoft/nni

NNI automates feature engineering, neural architecture search, hyperparameter tuning, and model compression for deep learning. Find the latest features, API, examples and tutorials in our official documentation.

The documentation has several tutorials and quick-start guides for a variety of situations, but its coverage for nonstandard operations is less than thorough (and in some places is outdated and/or self-contradictory).

Auto-PyTorch

Tagline

Automatic architecture search and hyperparameter optimization for PyTorch

Documentation

https://automl.github.io/Auto-PyTorch/master

Repository

https://github.com/automl/Auto-PyTorch

While early AutoML frameworks focused on optimizing traditional ML pipelines and their hyperparameters, another trend in AutoML is to focus on neural architecture search. To bring the best of these two worlds together, we developed Auto-PyTorch, which jointly and robustly optimizes the network architecture and the training hyperparameters to enable fully automated deep learning (AutoDL).

Auto-PyTorch is mainly developed to support tabular data (classification, regression) and time series data (forecasting). The newest features in Auto-PyTorch for tabular data are described in the paper “Auto-PyTorch Tabular: Multi-Fidelity MetaLearning for Efficient and Robust AutoDL” (see below for bibtex ref). Details about Auto-PyTorch for multi-horizontal time series forecasting tasks can be found in the paper “Efficient Automated Deep Learning for Time Series Forecasting”.

FLAML

Tagline

A fast library for AutoML and tuning

Website

https://microsoft.github.io/FLAML/

Discord server

https://discord.gg/Cppx2vSPVP

Repository

https://github.com/microsoft/FLAML

FLAML is a lightweight Python library for efficient automation of machine learning and AI operations. It automates workflow based on large language models, machine learning models, etc. and optimizes their performance.

FLAML enables building next-gen GPT-X applications based on multi-agent conversations with minimal effort. It simplifies the orchestration, automation and optimization of a complex GPT-X workflow. It maximizes the performance of GPT-X models and augments their weakness.

For common machine learning tasks like classification and regression, it quickly finds quality models for user-provided data with low computational resources. It is easy to customize or extend. Users can find their desired customizability from a smooth range.

It supports fast and economical automatic tuning (e.g., inference hyperparameters for foundation models, configurations in MLOps/LMOps workflows, pipelines, mathematical/statistical models, algorithms, computing experiments, software configurations), capable of handling large search space with heterogeneous evaluation cost and complex constraints/guidance/early stopping.

Featuretools

Tagline

An open source python framework for automated feature engineering

Website

https://www.featuretools.com/

Featuretools automatically creates features from temporal and relational datasets.

DeepHyper

Tagline

Distributed Neural Architecture and Hyperparameter Optimization for Machine Learning

Website

https://deephyper.readthedocs.io/en/latest/

DeepHyper is a powerful Python package for automating machine learning tasks, particularly focused on optimizing hyperparameters, searching for optimal neural architectures, and quantifying uncertainty through the deep ensembles. With DeepHyper, users can easily perform these tasks on a single machine or distributed across multiple machines, making it ideal for use in a variety of environments. Whether you’re a beginner looking to optimize your machine learning models or an experienced data scientist looking to streamline your workflow, DeepHyper has something to offer. So why wait? Start using DeepHyper today and take your machine learning skills to the next level!

AutoGluon

Tagline

AutoML for Image, Text, Time Series, and Tabular Data

Website

https://auto.gluon.ai/stable/index.html

See the website for several Quick Start guides and tutorials.

AdaNet

Tagline

AdaNet is a TensorFlow framework for fast and flexible AutoML with learning guarantees.

Website

https://adanet.readthedocs.io/en/v0.9.0/

AdaNet is a lightweight TensorFlow-based framework for automatically learning high-quality models with minimal expert intervention. AdaNet builds on recent AutoML efforts to be fast and flexible while providing learning guarantees. Importantly, AdaNet provides a general framework for not only learning a neural network architecture, but also for learning to ensemble to obtain even better models.

Neuraxio

Tagline

Code Machine Learning Pipelines - The Right Way.

Repository

https://github.com/Neuraxio/Neuraxle

Quote

The world’s cleanest AutoML library ✨ - Do hyperparameter tuning with the right pipeline abstractions to write clean deep learning production pipelines. Let your pipeline steps have hyperparameter spaces. Design steps in your pipeline like components. Compatible with Scikit-Learn, TensorFlow, and most other libraries, frameworks and MLOps environments.

Time series

Python

Temporian

Tagline

Temporian is to temporal data what Pandas is to tabular data.

Website

https://temporian.readthedocs.io/en/stable/

Temporian is a library for safe, simple and efficient preprocessing and feature engineering of temporal data in Python. Temporian supports multivariate time-series, multivariate time-sequences, event logs, and cross-source event streams.

functime

Tagline

Production-ready time series models

Website

https://www.functime.ai/

Tutorial

https://docs.functime.ai/forecasting/

Documentation

https://docs.functime.ai/

functime is a machine learning library for time-series predictions that just works.

Fully-featured: Powerful and easy-to-use API for forecasting and feature engineering (tsfresh, Catch22).

Fast: Forecast 100,000 time series in seconds on your laptop

Efficient: Extract 100s of time-series features in parallel using Polars

Battle-tested: Algorithms that deliver real business impact and win competitions

tsflex

Tagline

flexible time-series operations

Website

https://predict-idlab.github.io/tsflex/

tsflex … [is] a sequence first Python toolkit for processing & feature extraction, making few assumptions about input data. This makes tsflex suitable for use-cases such as inference on streaming data, performing operations on irregularly sampled series, a holistic approach for operating on multivariate asynchronous data, and dealing with time-gaps.

GluonTS

Tagline

GluonTS is a Python package for probabilistic time series modeling, focusing on deep learning based models, based on PyTorch and MXNet.

Website

https://ts.gluon.ai/stable/index.html

PyTorch-Forecasting

Website

https://pytorch-forecasting.readthedocs.io/en/stable/

Explainable DL

Python

Logic Explained Networks

Tagline

Logic Explained Networks is a python repository implementing explainable-by-design deep learning models.

Repository

https://github.com/pietrobarbiero/logic_explained_networks

The Logic Explained Network is a python repository providing a set of utilities and modules to build deep learning models that are explainable by design. This library provides both already implemented LENs classes and APIs classes to get First-Order Logic (FOL) explanations from neural networks.

Streaming ML

River

Tagline

River is a Python library for online machine learning. It aims to be the most user-friendly library for doing machine learning on streaming data. River is the result of a merger between creme and scikit-multiflow.

Website

https://riverml.xyz/latest/

Info

See also deep-river at https://github.com/online-ml/deep-river: “deep-river is a Python library for online deep learning. deep-river’s ambition is to enable online machine learning for neural networks. It combines the river API with the capabilities of designing neural networks based on PyTorch.”

Plotting and Data Visualization

Python

`seaborn`

Website

https://seaborn.pydata.org

Showcase

https://seaborn.pydata.org/examples/index.html

Tutorial

https://seaborn.pydata.org/tutorial.html

Perspective

Website

https://perspective.finos.org/

Perspective is an interactive analytics and data visualization component, which is especially well-suited for large and/or streaming datasets. Use it to create user-configurable reports, dashboards, notebooks and applications, then deploy stand-alone in the browser, or in concert with Python and/or Jupyterlab.

`scikit-learn` alternatives

PyCaret

Tagline

PyCaret is an open-source, low-code machine learning library in Python that automates machine learning workflows.

Website

https://pycaret.org/

Architecture-Specific

Transformers

Python

HuggingFace `transformers`

Tagline

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Documentation

https://huggingface.co/transformers

Repository

https://github.com/huggingface/transformers

See also this list of “Awesome projects built with transformers.”

NLP

Python

spaCy

Tagline

Industrial-strength NLP

"spaCy 101"

https://spacy.io/usage/spacy-101

Online course

https://course.spacy.io/

Videos

https://www.youtube.com/c/ExplosionAI

Usage guides

https://spacy.io/usage/

Project Templates

https://github.com/explosion/projects

Models

https://spacy.io/models

API Docs

https://spacy.io/api/

Universe

https://spacy.io/universe

VS Code Ext.

https://github.com/explosion/spacy-vscode

Website

https://spacy.io/

Examples

https://github.com/explosion/spaCy/tree/master/examples

Repository

https://github.com/explosion/spaCy

spaCy is a library for advanced Natural Language Processing in Python and Cython. It’s built on the very latest research, and was designed from day one to be used in real products.

spaCy comes with pretrained pipelines and currently supports tokenization and training for 70+ languages. It features state-of-the-art speed and neural network models for tagging, parsing, named entity recognition, text classification and more, multi-task learning with pretrained transformers like BERT, as well as a production-ready training system and easy model packaging, deployment and workflow management. spaCy is commercial open-source software, released under the MIT license.

Appears to use PyTorch for GPU support.

FARM (Framework for Adapting Representation Models)

Tagline

🏡 Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

Documentation

https://farm.deepset.ai/

Repository

https://github.com/deepset-ai/FARM

Contextualized Topic Models

Tagline

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.

Repository

https://github.com/MilaNLProc/contextualized-topic-models

Contextualized Topic Models (CTM) are a family of topic models that use pre-trained representations of language (e.g., BERT) to support topic modeling. See the papers for details:

Bianchi, F., Terragni, S., & Hovy, D. (2021). Pre-training is a Hot Topic: Contextualized Document Embeddings Improve Topic Coherence. ACL. https://aclanthology.org/2021.acl-short.96/

Bianchi, F., Terragni, S., Hovy, D., Nozza, D., & Fersini, E. (2021). Cross-lingual Contextualized Topic Models with Zero-shot Learning. EACL. https://www.aclweb.org/anthology/2021.eacl-main.143/

Our new topic modeling family supports many different languages (i.e., the one supported by HuggingFace models) and comes in two versions: CombinedTM combines contextual embeddings with the good old bag of words to make more coherent topics; ZeroShotTM is the perfect topic model for task in which you might have missing words in the test data and also, if trained with multilingual embeddings, inherits the property of being a multilingual topic model!

The big advantage is that you can use different embeddings for CTMs. Thus, when a new embedding method comes out you can use it in the code and improve your results. We are not limited by the BoW anymore.

An important aspect to take into account is which network you want to use: the one that combines contextualized embeddings and the BoW (CombinedTM) or the one that just uses contextualized embeddings (ZeroShotTM).

But remember that you can do zero-shot cross-lingual topic modeling only with the ZeroShotTM model.

Contextualized Topic Models also support supervision (SuperCTM).

We also have Kitty: a new submodule you can use to create a human-in-the-loop classifier to quickly classify your documents and create named clusters. This can be very useful to do document filtering. It also works in cross-lingual setting and thus you might be able to filter documents in a language you don’t know!

Repository README.md includes links to four Google Colab tutorial notebooks.

CUDA GPU support via PyTorch.

skweak

Tagline

A software toolkit for weak supervision applied to NLP tasks

Repository

https://github.com/NorskRegnesentral/skweak

Labelled data remains a scarce resource in many practical NLP scenarios. This is especially the case when working with resource-poor languages (or text domains), or when using task-specific labels without pre-existing datasets. The only available option is often to collect and annotate texts by hand, which is expensive and time-consuming.

skweak (pronounced /skwi:k/) is a Python-based software toolkit that provides a concrete solution to this problem using weak supervision. skweak is built around a very simple idea: Instead of annotating texts by hand, we define a set of labelling functions to automatically label our documents, and then aggregate their results to obtain a labelled version of our corpus.

The labelling functions may take various forms, such as domain-specific heuristics (like pattern-matching rules), gazetteers (based on large dictionaries), machine learning models, or even annotations from crowd-workers. The aggregation is done using a statistical model that automatically estimates the relative accuracy (and confusions) of each labelling function by comparing their predictions with one another.

skweak can be applied to both sequence labelling and text classification, and comes with a complete API that makes it possible to create, apply and aggregate labelling functions with just a few lines of code. The toolkit is also tightly integrated with SpaCy, which makes it easy to incorporate into existing NLP pipelines. Give it a try!

Medical Imaging

Python

MedicalZooPytorch

Tagline

A pytorch-based deep learning framework for multi-modal 2D/3D medical image segmentation;
A 3D multi-modal medical image segmentation library in PyTorch

Repository

https://github.com/black0017/MedicalZooPytorch

Includes quick-start guide and Colab tutorial notebook.

Dev Tools

NN Design Software

Cerbrec Graphbook

Tagline

AI Modeling, Made Intuitive;
The diagramming platform that allows data scientists to focus on model architecture

Documentation

https://cerbrec.com/documentation

YouTube channel

https://www.youtube.com/@Graphbook

Slack community

https://cerbrec-community.slack.com/ssb/redirect

Repository

https://github.com/cerbrec/graphbook

Graphbook is a new visual IDE for AI and deep learning model development that lets you build and run directly on a visualization. For example, you can customize transformers directly in the platform, train, and serve them to a URL. Graphbook is still in beta mode and we are developing more models and product features over time.

The Github repository contains a number of NLP models, grouped into the following categories:

Classifiers
Generators
Next Token
Tokenizers
Transformers

There is also a small Community Gallery.

A Python scripting interface (“PyGraphbook”) is in progress.

NN Visualization Tools

nn_vis

Website

https://github.com/julrog/nn_vis

Collection of NN Architecture Visualization Tools

Tagline

Tools to Design or Visualize Architecture of Neural Network

Repository

https://github.com/ashishpatel26/Tools-to-Design-or-Visualize-Architecture-of-Neural-Network

NN Architecture “Zoos”

modelzoo.co

Link

https://modelzoo.co/

Models are grouped by framework (ex. PyTorch, TensorFlow) as well as by category (Computer Vision, NLP, RL, etc).

Reinforcement Learning Misc.

DeepRTS

Misc. Tools

(UNDER CONSTRUCTION)

Data Science Tools

Notebook Tools

Quarto

Website

https://www.quarto.org

Diagrams

Markup Languages

D2

svgbob

Website

https://ivanceras.github.io/svgbob-editor/

Svgbob is a diagramming model which uses a set of typing characters to approximate the intended shape. It uses a combination of characters which are readily available on your keyboards.

What can it do?

Basic shapes

Quick logo scribbles

Even unicode box drawing characters are supported

Circle, quarter arcs, half circles, 3/4 quarter arcs

Grids

Graphics Diagram

CJK characters

Sequence Diagrams

Plot diagrams

Railroad diagrams

Statistical charts

Flow charts

Block diagrams

Mindmaps

It can do complex stuff such as circuit diagrams

Latest addition: Styling of tagged shapes

Diagram Tools

text-to-diagram.com

Kroki

Domain-Specific Tools

DeepBrain (for medical imaging)

Tagline

Deep Learning tools for brain medical images

Repository

https://github.com/iitzco/deepbrain

Experiment and Data-Collection Tools

labgraph

Tagline

LabGraph is a streaming framework built by the Facebook Reality Labs Research team at Facebook.

Intro. Article

https://tech.fb.com/bci-milestone-new-research-from-ucsf-with-support-from-facebook-shows-the-potential-of-brain-computer-interfaces-for-restoring-speech-communication/

Documenation

https://github.com/facebookresearch/labgraph/blob/main/docs/index.md

Repository

https://github.com/facebookresearch/labgraph

LabGraph is a Python framework for rapidly prototyping experimental systems for real-time streaming applications. It is particularly well-suited to real-time neuroscience, physiology and psychology experiments.

Appendix A: Annotation and Notekeeping

Collaborative web annotation

There are probably others, but here is what I have found so far.

hypothes.is

Tagline

Collaborate with anyone, anywhere. Use Hypothesis to annotate anything online with classmates, colleagues, or friends.

Website

https://web.hypothes.is/

Repository

https://github.com/hypothesis

Roadmap

https://web.hypothes.is/product-update/

Many notekeeping options (see below) have plugins for integration with Hypothesis.

Memex

Tagline

Organise, annotate & discuss websites, PDFs, YouTube videos without leaving your tab. By yourself, with your peers & enhanced by AI

Website

https://memex.garden/

See the website for a demo video for researchers.

Hierarchical note-taking with backlinks and cross-references

Primarily Offline Interface

These offer limited real-time collaboration but don’t have the same lock-in effect (so you aren’t constrained to stay with any particular system, service, or app). Furthermore, unlike the systems that are primarily online services, users can create plugins for them.

Obsidian

Tagline

Obsidian is the private and flexible note‑taking app that adapts to the way you think.

Price

Free; pay for sync hosting (pricing here)

File format

Markdown (.md)

Plugins

https://obsidian.md/plugins

Website

https://obsidian.md/

Roadmap

https://obsidian.md/roadmap/

Supposedly more well-polished and full-featured than Logseq.

Has plugins (2000+) for Hypothesis integration, time-synced notes on audio files, automatic captioning, PDF annotation, embedded drawings, chemistry diagrams, integrated terminals, graphical chart editing, flashcards and spaced repetition, and more.

Anyone can develop their own plugins with the plugin API.

Has apps for every major platform.

Some additional information

Logseq

Tagline

A privacy-first, open-source knowledge base;
Connect your notes, increase understanding.

Price

Free; pay for sync hosting

File format

Markdown (.md)

Website

https://logseq.com/

Docs Hub

https://hub.logseq.com/

Repository

https://github.com/logseq/logseq

Has plugins (150+) for Hypothesis integration, time-synced notes on audio files, automatic captioning, PDF annotation, embedded drawings, chemistry diagrams, integrated terminals, graphical chart editing, flashcards and spaced repetition, and more.

Anyone can develop their own plugins with the plugin API.

Has apps for every major platform.

OneNote

Tagline

One cross-functional notebook for all your notetaking needs.

Price

included with Office365

File format

Proprietary

Website

https://www.onenote.com/

Product Overview

https://www.microsoft.com/en-us/microsoft-365/onenote/digital-note-taking-app

Perhaps more intuitive to use than Obsidian, and is part of the Microsoft ecosystem. However, there are fewer plugins (for integration with other services) and the notes cannot be edited with other programs.

Offers better real-time collaboration features than the other offline-oriented options.

Has apps for every major platform and a web interface.

Primarily/Exclusively Online

These offer effortless real-time collaboration (whereas collaboration with the above [except OneNote] would be more like pushing to a Git repository) but are generally pricier, lack robust offline access to notes, and have much higher lock-in (meaning it’s difficult, if not nearly impossible, to migrate to other note systems). Editing using other editors is generally not possible, although usually there are export options. (However, lots of exporting and importing is not convenient and also does not make use of the collaborative features.)

Notion

Tagline

Notion is the connected workspace where better, faster work happens.

Price

Free with limitations; [pricing here](https://www.notion.so/pricing)

File format

N/A (no offline interface; some export options)

Website

https://www.notion.so/product

Has apps for every major platform and a web app.

Roam Research

Tagline

A note-taking tool for networked thought.

Price

$15/month, $165/year; $500/5 years

File format

N/A (no offline interface; some export options)

Website

https://roamresearch.com/

Demo vids

https://roamresearch.com/#/app/help/page/2h9V-FtAO

As easy to use as a document. As powerful as a graph database.

Roam helps you organize your research for the long haul.

Also has apps for every major platform, as well as a web interface.

Saga.so

Tagline

Connected notes, tasks, and tools. For you and your team.

Price

Free with limitations ([pricing here](https://saga.so/pricing))

File format

N/A (no offline interface; some export options)

Website

https://saga.so/

Roadmap

https://sagahq.canny.io/

Desktop apps for Windows and Mac (no Linux, mobile, or web interface at the moment).

Markdown editors geared towards research/academia

These are compatible with systems that are based on Markdown (.md) files, such as Obsidian and Logseq. (Note that while the online-oriented systems typically also make use of Markdown syntax, transparent access to Markdown files is not available.)

Zettlr

Tagline

Your one-stop publication workbench

Price

Free

File format

Markdown (.md)

Website

https://www.zettlr.com/

Keyboard Ref

https://www.zettlr.com/shortcuts

Documentation

https://docs.zettlr.com/

Repository

https://github.com/Zettlr/Zettlr

From idea to publication in one app: Zettlr accompanies you while writing your blog post, newspaper article, term paper, thesis, or entire book.

Privacy First:

Zettlr is Privacy First: There is no forced cloud-synchronization and all files stay on your computer.

From Idea to Publication:

Manage all your writing projects from one app: From the initial idea to a final publication, already typeset in the appropriate template.

First-class Citation Support:

Hook Zettlr into your reference manager to have all your sources when you need them.

Academic literature and citation management

Reference Managers

Zotero

Tagline

Your personal research assistant: Zotero is a free, easy-to-use tool to help you collect, organize, annotate, cite, and share research.

Price

Free, with the option to buy additional cloud storage

Website

https://zotero.org

Repository

https://github.com/zotero/zotero

I’ve noticed that more note-keeping, citation management, and bibliography tools have integration options for Zotero than do for Mendeley. This leads me to believe that Zotero is more popular / widely used. This could also be due to observation bias since I am personally only familiar with Zotero.

Has apps for all major platforms. Not the prettiest per se, but has plenty of useful features.

Note also zoterobib, an officially-endorsed bibliography tool. (Not the same as Zotero, but made by the same development team.)

Mendeley

Price

Probably not free, because Elsevier.

Website

https://wwww.mendeley.com/guides/mendeley-reference-manager/

Appears to be a subset of Elsevier, as its homepage (<www.mendeley.com>) redirected me to the Elsevier website after a short delay.

Article Managers

ReadCube

Tagline

Transform how your team works with scholarly literature

Price

Pricing not listed, but definitely not free

Website

https://www.readcube.com

ReadCube’s literature management system helps businesses discover, organize, read, annotate, share, and cite research. Simplify your day-to-day so you’re free to make tomorrow’s discoveries.

The ReadCube website has a page where it provides comparisons of its services with those of Rightfind, Article Galaxy, Endnote, and Mendeley. I find it a bit suspicious that it doesn’t compare itself with Paperpile or any of the other popular similar services.

PaperPile

Tagline

The no-fuss reference manager for the web;
Manage your research library right in your browser.

Price

$2.99/mo ([pricing](https://paperpile.com/pricing/))

Website

https://paperpile.com/

See its Features page for an overview. (Categories: Manage References, Find and Collect, Organize PDFs, Highlight and annotate, Share and Collaborate, Cite in Google Docs).

Its Feature Roadmap is also available for viewing.

Advertises that it has a plugin for Microsoft Word and Google Docs. It also has mobile apps for Android/iOS.

Other

PaperMemory

Tagline

PaperMemory automatically records and organizes the papers you read, without ever leaving your browser.

Price

Free

Website

https://papermemory.org/

Repository

https://github.com/vict0rsch/PaperMemory

It is a browser extension.

Parse papers you open automatically.

Papers are stored in your Memory automatically, without a click. You can then search them, tag them, take personal notes etc.

Match preprints to publications

By querying SemanticScholar, DBLP and CrossRef, PaperMemory can discover the proper publication of Arxiv pre-prints.

You live in your browser? So do your papers

Share papers to your favorite apps by copying:

a BibTex entry for Overleaf

a Markdown link [title](url) link for Github, HackMD or Notion

a HyperText link for emails, Google Docs, Slack, etc.

Discover code repositories Using the PapersWithCode API, PaperMemory will match code repositories with papers in your Memory.

Enhance ArXiv.org

Display the actual pubication venue of published papers, a link to the code repository, copy the BibTex entry etc.

Instantly copy .bib-compatible bibliography entries Export a paper’s BibTex entry directly from the extension, or bulk export BibTex entries by paper tag. You can even use PaperMemory to update the ArXiv entries of a stand-alone .bib file.

Highly customizable

Change the theme to light or dark, control the default link copied to your clipboard, add links to SciRate / HuggingFace Papers / Ar5iv / ArxivSanity, trigger parsing manually, export / import papers etc.

And many more features! Github Gist synchronization, regex-based automatic paper tagging, arbitrary website parsing to record Blog posts or dataset websites, etc.

Visual and Interactive ML Resources