The Incredible PyTorch - Ritchie Ng

This is a curated list of tutorials, projects, libraries, videos, papers, books and anything related to the incredible PyTorch. Feel free to make a pull request to contribute to this list.

Table Of Contents
  • Table Of Contents
    • Tutorials
    • Large Language Models (LLMs)
    • Agentic AI
    • Guardrails and AI Safety
    • Tabular Data
    • Visualization
    • Explainability
    • Object Detection
    • Long-Tailed / Out-of-Distribution Recognition
    • Activation Functions
    • Energy-Based Learning
    • Missing Data
    • Architecture Search
    • Continual Learning
    • Optimization
    • Quantization
    • Quantum Machine Learning
    • Neural Network Compression
    • Facial, Action and Pose Recognition
    • Super resolution
    • Synthetesizing Views
    • Voice
    • Medical
    • 3D Segmentation, Classification and Regression
    • Video Recognition
    • Recurrent Neural Networks (RNNs)
    • Convolutional Neural Networks (CNNs)
    • Segmentation
    • Geometric Deep Learning: Graph \& Irregular Structures
    • Sorting
    • Ordinary Differential Equations Networks
    • Multi-task Learning
    • GANs, VAEs, and AEs
    • Unsupervised Learning
    • Adversarial Attacks
    • Style Transfer
    • Image Captioning
    • Transformers
    • Similarity Networks and Functions
    • Reasoning
    • General NLP
    • Question and Answering
    • Speech Generation and Recognition
    • Document and Text Classification
    • Text Generation
    • Text to Image
    • Translation
    • Sentiment Analysis
    • Deep Reinforcement Learning
    • Deep Bayesian Learning and Probabilistic Programmming
    • Spiking Neural Networks
    • Anomaly Detection
    • Regression Types
    • Time Series
    • Synthetic Datasets
    • Neural Network General Improvements
    • DNN Applications in Chemistry and Physics
    • New Thinking on General Neural Network Architecture
    • Linear Algebra
    • API Abstraction
    • Low Level Utilities
    • PyTorch Utilities
    • PyTorch Video Tutorials
    • Community
    • To be Classified
    • Links to This Repository
    • Contributions
    • [New Special Dedicated List to AI Agents The Incredible AI Agents](#new-special-dedicated-list-to-ai-agents–the-incredible-ai-agents)

Tutorials

  • Official PyTorch Tutorials
  • Official PyTorch Examples
  • Dive Into Deep Learning with PyTorch
  • How to Read Pytorch
  • Minicourse in Deep Learning with PyTorch (Multi-language)
  • Practical Deep Learning with PyTorch
  • Deep Learning Models
  • C++ Implementation of PyTorch Tutorial
  • Simple Examples to Introduce PyTorch
  • Mini Tutorials in PyTorch
  • Deep Learning for NLP
  • Deep Learning Tutorial for Researchers
  • Fully Convolutional Networks implemented with PyTorch
  • Simple PyTorch Tutorials Zero to ALL
  • DeepNLP-models-Pytorch
  • MILA PyTorch Welcome Tutorials
  • Effective PyTorch, Optimizing Runtime with TorchScript and Numerical Stability Optimization
  • Practical PyTorch
  • PyTorch Project Template
  • Semantic Search with PyTorch

Large Language Models (LLMs)

  • LLM Tutorials
    • Build a Large Language Model (From Scratch)
    • Hugginface LLM Training Book, a collection of methodologies to help with successful training of large language models
  • General
    • Starcoder 2, family of code generation models
    • GPT Fast, fast and hackable pytorch native transformer inference
    • Mixtral Offloading, run Mixtral-8x7B models in Colab or consumer desktops
    • Llama
    • Llama Recipes
    • TinyLlama
    • Mosaic Pretrained Transformers (MPT)
    • VLLM, high-throughput and memory-efficient inference and serving engine for LLMs
    • Dolly
    • Vicuna
    • Mistral 7B
    • BigDL LLM, library for running LLM (large language model) on Intel XPU (from Laptop to GPU to Cloud) using INT4 with very low latency1 (for any PyTorch model)
    • Simple LLM Finetuner
    • Petals, run LLMs at home, BitTorrent-style, fine-tuning and inference up to 10x faster than offloading
    • Gemma, Google’s family of lightweight, state-of-the-art open models
    • Qwen, Alibaba Cloud’s large language model
    • CodeT5, code-aware encoder-decoder model for code understanding and generation
    • OpenLLaMA, permissively licensed open source reproduction of Meta AI’s LLaMA
    • RedPajama, leading open-source models with package to reproduce LLaMA training dataset
    • MosaicML LLM Foundry, codebase for training, finetuning, and deploying LLMs
  • Japanese
    • Japanese Llama
    • Japanese GPT Neox and Open Calm
  • Chinese
    • Chinese Llamma-2 7B
    • Chinese Vicuna
  • Retrieval Augmented Generation (RAG)
    • LlamaIndex, data framework for your LLM application
  • Embeddings
    • ChromaDB, open-source embedding database
  • Applications
    • Langchain, building applications with LLMs through composability
    • LangSmith, platform for building production-grade LLM applications
    • LiteLLM, call all LLM APIs using the OpenAI format
    • OpenAI Python, official Python library for the OpenAI API
    • Guidance, library for controlling large language models
  • Finetuning
    • Huggingface PEFT, State-of-the-art Parameter-Efficient Fine-Tuning
    • Unsloth, finetune LLMs 2-5x faster with 80% less memory
    • LoRA, Low-Rank Adaptation of Large Language Models
    • QLoRA, efficient finetuning of quantized LLMs
    • Axolotl, tool designed to streamline the fine-tuning of various AI models
    • LLaMA-Factory, unified efficient fine-tuning of 100+ LLMs
  • Training
    • Higgsfield, Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
    • DeepSpeed, deep learning optimization library
    • FairScale, PyTorch extensions for high performance and large scale training
    • Accelerate, simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
    • ColossalAI, unified deep learning system for large-scale model training and inference
  • Quantization
    • AutoGPTQ, easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm
    • BitsAndBytes, accessible large language models via k-bit quantization
    • GPTQ-for-LLaMa, 4 bits quantization of LLaMA using GPTQ
    • Optimum, acceleration of 🤗 Transformers and 🤗 Diffusers

Agentic AI

  • Multi-Agent Systems
    • LangGraph, library for building stateful, multi-actor applications with LLMs
    • AutoGen, library that enables the creation of applications using multiple agents that can converse with each other
    • CrewAI, framework for orchestrating role-playing, autonomous AI agents
    • MetaGPT, multi-agent framework for software company simulation
    • AgentScope, user-friendly multi-agent platform
    • Swarm, educational framework for building and deploying multi-agent systems
  • Autonomous Agents
    • AutoGPT, autonomous GPT-4 experiment to make GPT-4 fully autonomous
    • BabyAGI, example of an AI-powered task management system
    • LangChain Agents, building agents with LangChain
    • ReAct: Reasoning and Acting with Language Models
    • Voyager, open-ended embodied agent with large language models
  • Agent Orchestration and Frameworks
    • Semantic Kernel, lightweight SDK for integrating AI services
    • OpenAI Function Calling, tools for function calling with OpenAI models
    • LlamaIndex Agents, data agents with LlamaIndex
    • Haystack Agents, building search and QA agents
    • DSPy, framework for algorithmically optimizing LM prompts and weights
  • Planning and Reasoning
    • Tree of Thoughts, deliberate problem solving with large language models
    • ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models
    • Plan-and-Solve Prompting
  • Memory and Learning
    • MemGPT, creating LLM agents with long-term memory
    • Zep, fast, scalable building blocks for production LLM apps

Guardrails and AI Safety

  • Content Filtering and Moderation
    • Guardrails AI, framework for building reliable AI applications
    • NeMo Guardrails, toolkit for building trustworthy, safe and secure LLM applications
    • OpenAI Moderation API Tools
    • Detoxify, toxic comment classification using transformer models
    • Perspective API PyTorch Implementation, toxicity detection
  • Prompt Injection Defense
    • Prompt Injection Detector, detecting prompt injection attacks
    • LLM Guard, security toolkit for LLM interactions
    • Garak, LLM vulnerability scanner
  • Bias Detection and Mitigation
    • FairLearn, toolkit for assessing and improving fairness
    • AIF360, comprehensive set of fairness metrics and bias mitigation algorithms
    • What-If Tool, tool for analyzing and understanding ML models
  • Privacy and Security
    • Opacus, library for training PyTorch models with differential privacy
    • PySyft, secure and private Deep Learning framework
    • CrypTen, framework for Privacy Preserving Machine Learning
    • Adversarial Robustness Toolbox, library for adversarial attacks and defenses
  • Model Interpretability and Explainability
    • LIME, explaining the predictions of machine learning classifiers
    • SHAP, unified approach to explain the output of machine learning models
    • InterpretML, interpret and understand machine learning models
    • Alibi, algorithms for explaining machine learning models
  • Safety Evaluation and Testing
    • AI Safety Gym, environments and tools for AI safety research
    • Anthropic’s Constitutional AI implementations
    • HarmBench, standardized evaluation framework for automated red teaming

Tabular Data

  • PyTorch Frame: A Modular Framework for Multi-Modal Tabular Learning
  • Pytorch Tabular,standard framework for modelling Deep Learning Models for tabular data
  • Tab Transformer
  • PyTorch-TabNet: Attentive Interpretable Tabular Learning
  • carefree-learn: A minimal Automatic Machine Learning (AutoML) solution for tabular datasets based on PyTorch

Visualization

  • Loss Visualization
  • Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
  • Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps
  • SmoothGrad: removing noise by adding noise
  • DeepDream: dream-like hallucinogenic visuals
  • FlashTorch: Visualization toolkit for neural networks in PyTorch
  • Lucent: Lucid adapted for PyTorch
  • DreamCreator: Training GoogleNet models for DeepDream with custom datasets made simple
  • CNN Feature Map Visualisation

Explainability

  • Neural-Backed Decision Trees
  • Efficient Covariance Estimation from Temporal Data
  • Hierarchical interpretations for neural network predictions
  • Shap, a unified approach to explain the output of any machine learning model
  • VIsualizing PyTorch saved .pth deep learning models with netron
  • Distilling a Neural Network Into a Soft Decision Tree
  • Captum, A unified model interpretability library for PyTorch

Object Detection

  • MMDetection Object Detection Toolbox
  • Mask R-CNN Benchmark: Faster R-CNN and Mask R-CNN in PyTorch 1.0
  • YOLO-World
  • YOLOS
  • YOLOF
  • YOLOX
  • YOLOv12: Attention-Centric Real-Time Object Detectors
  • YOLOv11
  • YOLOv10
  • YOLOv9
  • YOLOv8
  • Yolov7
  • YOLOv6
  • Yolov5
  • Yolov4
  • YOLOv3
  • YOLOv2: Real-Time Object Detection
  • SSD: Single Shot MultiBox Detector
  • Detectron models for Object Detection
  • Multi-digit Number Recognition from Street View Imagery using Deep Convolutional Neural Networks
  • Whale Detector
  • Catalyst.Detection

Long-Tailed / Out-of-Distribution Recognition

  • Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-Case Generalization
  • Invariant Risk Minimization
  • Training Confidence-Calibrated Classifier for Detecting Out-of-Distribution Samples
  • Deep Anomaly Detection with Outlier Exposure
  • Large-Scale Long-Tailed Recognition in an Open World
  • Principled Detection of Out-of-Distribution Examples in Neural Networks
  • Learning Confidence for Out-of-Distribution Detection in Neural Networks
  • PyTorch Imbalanced Class Sampler

Activation Functions

  • Rational Activations - Learnable Rational Activation Functions
  • FreeGrad, PyTorch library for custom backward passes, straight-through estimators and gradient transforms.

Energy-Based Learning

  • EBGAN, Energy-Based GANs
  • Maximum Entropy Generators for Energy-based Models

Missing Data

  • BRITS: Bidirectional Recurrent Imputation for Time Series
  • EfficientNetV2
  • DenseNAS
  • DARTS: Differentiable Architecture Search
  • Efficient Neural Architecture Search (ENAS)
  • EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

Continual Learning

  • Renate, Automatic Retraining of Neural Networks

Optimization

  • AccSGD, AdaBound, AdaMod, DiffGrad, Lamb, NovoGrad, RAdam, SGDW, Yogi and more
  • Lookahead Optimizer: k steps forward, 1 step back
  • RAdam, On the Variance of the Adaptive Learning Rate and Beyond
  • Over9000, Comparison of RAdam, Lookahead, Novograd, and combinations
  • AdaBound, Train As Fast as Adam As Good as SGD
  • Riemannian Adaptive Optimization Methods
  • L-BFGS
  • OptNet: Differentiable Optimization as a Layer in Neural Networks
  • Learning to learn by gradient descent by gradient descent
  • Surrogate Gradient Learning in Spiking Neural Networks
  • TorchOpt: An Efficient Library for Differentiable Optimization

Quantization

  • Additive Power-of-Two Quantization: An Efficient Non-uniform Discretization For Neural Networks

Quantum Machine Learning

  • Tor10, generic tensor-network library for quantum simulation in PyTorch
  • PennyLane, cross-platform Python library for quantum machine learning with PyTorch interface

Neural Network Compression

  • Bayesian Compression for Deep Learning
  • Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research
  • Learning Sparse Neural Networks through L0 regularization
  • Energy-constrained Compression for Deep Neural Networks via Weighted Sparse Projection and Layer Input Masking
  • EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis
  • Pruning Convolutional Neural Networks for Resource Efficient Inference
  • Pruning neural networks: is it time to nip it in the bud? (showing reduced networks work better)

Facial, Action and Pose Recognition

  • Facenet: Pretrained Pytorch face detection and recognition models
  • DGC-Net: Dense Geometric Correspondence Network
  • High performance facial recognition library on PyTorch
  • FaceBoxes, a CPU real-time face detector with high accuracy
  • How far are we from solving the 2D & 3D Face Alignment problem? (and a dataset of 230,000 3D facial landmarks)
  • Learning Spatio-Temporal Features with 3D Residual Networks for Action Recognition
  • PyTorch Realtime Multi-Person Pose Estimation
  • SphereFace: Deep Hypersphere Embedding for Face Recognition
  • GANimation: Anatomically-aware Facial Animation from a Single Image
  • Shufflenet V2 by Face++ with better results than paper
  • Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach
  • Unsupervised Learning of Depth and Ego-Motion from Video
  • FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
  • FlowNet: Learning Optical Flow with Convolutional Networks
  • Optical Flow Estimation using a Spatial Pyramid Network
  • OpenFace in PyTorch
  • Deep Face Recognition in PyTorch

Super resolution

  • Enhanced Deep Residual Networks for Single Image Super-Resolution
  • Superresolution using an efficient sub-pixel convolutional neural network
  • Perceptual Losses for Real-Time Style Transfer and Super-Resolution

Synthetesizing Views

  • NeRF, Neural Radian Fields, Synthesizing Novels Views of Complex Scenes

Voice

  • Google AI VoiceFilter: Targeted Voice Separatation by Speaker-Conditioned Spectrogram Masking

Medical

  • Medical Zoo, 3D multi-modal medical image segmentation library in PyTorch
  • U-Net for FLAIR Abnormality Segmentation in Brain MRI
  • Genomic Classification via ULMFiT
  • Deep Neural Networks Improve Radiologists’ Performance in Breast Cancer Screening
  • Delira, lightweight framework for medical imaging prototyping
  • V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation
  • Medical Torch, medical imaging framework for PyTorch
  • TorchXRayVision - A library for chest X-ray datasets and models. Including pre-trainined models.

3D Segmentation, Classification and Regression

  • Kaolin, Library for Accelerating 3D Deep Learning Research
  • PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
  • 3D segmentation with MONAI and Catalyst

Video Recognition

  • Dancing to Music
  • Devil Is in the Edges: Learning Semantic Boundaries from Noisy Annotations
  • Deep Video Analytics
  • PredRNN: Recurrent Neural Networks for Predictive Learning using Spatiotemporal LSTMs

Recurrent Neural Networks (RNNs)

  • SRU: training RNNs as fast as CNNs
  • Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks
  • Averaged Stochastic Gradient Descent with Weight Dropped LSTM
  • Training RNNs as Fast as CNNs
  • Quasi-Recurrent Neural Network (QRNN)
  • ReSeg: A Recurrent Neural Network-based Model for Semantic Segmentation
  • A Recurrent Latent Variable Model for Sequential Data (VRNN)
  • Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks
  • Attention-Based Recurrent Neural Network Models for Joint Intent Detection and Slot Filling
  • Attentive Recurrent Comparators
  • Collection of Sequence to Sequence Models with PyTorch
    1. Vanilla Sequence to Sequence models
    2. Attention based Sequence to Sequence models
    3. Faster attention mechanisms using dot products between the final encoder and decoder hidden states

Convolutional Neural Networks (CNNs)

  • LegoNet: Efficient Convolutional Neural Networks with Lego Filters
  • MeshCNN, a convolutional neural network designed specifically for triangular meshes
  • Octave Convolution
  • PyTorch Image Models, ResNet/ResNeXT, DPN, MobileNet-V3/V2/V1, MNASNet, Single-Path NAS, FBNet
  • Deep Neural Networks with Box Convolutions
  • Invertible Residual Networks
  • Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks
  • Faster Faster R-CNN Implementation
    • Faster R-CNN Another Implementation
  • Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer
  • Wide ResNet model in PyTorch -DiracNets: Training Very Deep Neural Networks Without Skip-Connections
  • An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition
  • Efficient Densenet
  • Video Frame Interpolation via Adaptive Separable Convolution
  • Learning local feature descriptors with triplets and shallow convolutional neural networks
  • Densely Connected Convolutional Networks
  • Very Deep Convolutional Networks for Large-Scale Image Recognition
  • SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size
  • Deep Residual Learning for Image Recognition
  • Training Wide ResNets for CIFAR-10 and CIFAR-100 in PyTorch
  • Deformable Convolutional Network
  • Convolutional Neural Fabrics
  • Deformable Convolutional Networks in PyTorch
  • Dilated ResNet combination with Dilated Convolutions
  • Striving for Simplicity: The All Convolutional Net
  • Convolutional LSTM Network
  • Big collection of pretrained classification models
  • PyTorch Image Classification with Kaggle Dogs vs Cats Dataset
  • CIFAR-10 on Pytorch with VGG, ResNet and DenseNet
  • Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)
  • NVIDIA/unsupervised-video-interpolation

Segmentation

  • Detectron2 by FAIR
  • Pixel-wise Segmentation on VOC2012 Dataset using PyTorch
  • Pywick - High-level batteries-included neural network training library for Pytorch
  • Improving Semantic Segmentation via Video Propagation and Label Relaxation
  • Super-BPD: Super Boundary-to-Pixel Direction for Fast Image Segmentation
  • Catalyst.Segmentation
  • Segmentation models with pretrained backbones

Geometric Deep Learning: Graph & Irregular Structures

  • PyTorch Geometric, Deep Learning Extension
  • PyTorch Geometric Temporal: A Temporal Extension Library for PyTorch Geometric
  • PyTorch Geometric Signed Directed: A Signed & Directed Extension Library for PyTorch Geometric
  • ChemicalX: A PyTorch Based Deep Learning Library for Drug Pair Scoring
  • Self-Attention Graph Pooling
  • Position-aware Graph Neural Networks
  • Signed Graph Convolutional Neural Network
  • Graph U-Nets
  • Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks
  • MixHop: Higher-Order Graph Convolutional Architectures via Sparsified Neighborhood Mixing
  • Semi-Supervised Graph Classification: A Hierarchical Graph Perspective
  • PyTorch BigGraph by FAIR for Generating Embeddings From Large-scale Graph Data
  • Capsule Graph Neural Network
  • Splitter: Learning Node Representations that Capture Multiple Social Contexts
  • A Higher-Order Graph Convolutional Layer
  • Predict then Propagate: Graph Neural Networks meet Personalized PageRank
  • Lorentz Embeddings: Learn Continuous Hierarchies in Hyperbolic Space
  • Graph Wavelet Neural Network
  • Watch Your Step: Learning Node Embeddings via Graph Attention
  • Signed Graph Convolutional Network
  • Graph Classification Using Structural Attention
  • SimGNN: A Neural Network Approach to Fast Graph Similarity Computation
  • SINE: Scalable Incomplete Network Embedding
  • HypER: Hypernetwork Knowledge Graph Embeddings
  • TuckER: Tensor Factorization for Knowledge Graph Completion
  • PyKEEN: A Python library for learning and evaluating knowledge graph embeddings
  • Pathfinder Discovery Networks for Neural Message Passing
  • SSSNET: Semi-Supervised Signed Network Clustering
  • MagNet: A Neural Network for Directed Graphs
  • PyTorch Geopooling: Geospatial Pooling Modules for Neural Networks in PyTorch

Sorting

  • Stochastic Optimization of Sorting Networks via Continuous Relaxations

Ordinary Differential Equations Networks

  • Latent ODEs for Irregularly-Sampled Time Series
  • GRU-ODE-Bayes: continuous modelling of sporadically-observed time series

Multi-task Learning

  • Hierarchical Multi-Task Learning Model
  • Task-based End-to-end Model Learning
  • torchMTL: A lightweight module for Multi-Task Learning in pytorch

GANs, VAEs, and AEs

  • BigGAN: Large Scale GAN Training for High Fidelity Natural Image Synthesis
  • High Fidelity Performance Metrics for Generative Models in PyTorch
  • Mimicry, PyTorch Library for Reproducibility of GAN Research
  • Clean Readable CycleGAN
  • StarGAN
  • Block Neural Autoregressive Flow
  • High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs
  • A Style-Based Generator Architecture for Generative Adversarial Networks
  • GANDissect, PyTorch Tool for Visualizing Neurons in GANs
  • Learning deep representations by mutual information estimation and maximization
  • Variational Laplace Autoencoders
  • VeGANS, library for easily training GANs
  • Progressive Growing of GANs for Improved Quality, Stability, and Variation
  • Conditional GAN
  • Wasserstein GAN
  • Adversarial Generator-Encoder Network
  • Image-to-Image Translation with Conditional Adversarial Networks
  • Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks
  • On the Effects of Batch and Weight Normalization in Generative Adversarial Networks
  • Improved Training of Wasserstein GANs
  • Collection of Generative Models with PyTorch
    • Generative Adversarial Nets (GAN)
      1. Vanilla GAN
      2. Conditional GAN
      3. InfoGAN
      4. Wasserstein GAN
      5. Mode Regularized GAN
    • Variational Autoencoder (VAE)
      1. Vanilla VAE
      2. Conditional VAE
      3. Denoising VAE
      4. Adversarial Autoencoder
      5. Adversarial Variational Bayes
  • Improved Training of Wasserstein GANs
  • CycleGAN and Semi-Supervised GAN
  • Improving Variational Auto-Encoders using Householder Flow and using convex combination linear Inverse Autoregressive Flow
  • PyTorch GAN Collection
  • Generative Adversarial Networks, focusing on anime face drawing
  • Simple Generative Adversarial Networks
  • Adversarial Auto-encoders
  • torchgan: Framework for modelling Generative Adversarial Networks in Pytorch
  • Evaluating Lossy Compression Rates of Deep Generative Models
  • Catalyst.GAN
    1. Vanilla GAN
    2. Conditional GAN
    3. Wasserstein GAN
    4. Improved Training of Wasserstein GANs

Unsupervised Learning

  • Unsupervised Embedding Learning via Invariant and Spreading Instance Feature
  • AND: Anchor Neighbourhood Discovery

Adversarial Attacks

  • Deep Neural Networks are Easily Fooled: High Confidence Predictions for Unrecognizable Images
  • Explaining and Harnessing Adversarial Examples
  • AdverTorch - A Toolbox for Adversarial Robustness Research

Style Transfer

  • Pystiche: Framework for Neural Style Transfer
  • Detecting Adversarial Examples via Neural Fingerprinting
  • A Neural Algorithm of Artistic Style
  • Multi-style Generative Network for Real-time Transfer
  • DeOldify, Coloring Old Images
  • Neural Style Transfer
  • Fast Neural Style Transfer
  • Draw like Bob Ross

Image Captioning

  • CLIP (Contrastive Language-Image Pre-Training)
  • Neuraltalk 2, Image Captioning Model, in PyTorch
  • Generate captions from an image with PyTorch
  • DenseCap: Fully Convolutional Localization Networks for Dense Captioning

Transformers

  • Attention is all you need
  • Spatial Transformer Networks

Similarity Networks and Functions

  • Conditional Similarity Networks

Reasoning

  • Inferring and Executing Programs for Visual Reasoning

General NLP

  • nanoGPT, fastest repository for training/finetuning medium-sized GPTs
  • minGPT, Re-implementation of GPT to be small, clean, interpretable and educational
  • Espresso, Module Neural Automatic Speech Recognition Toolkit
  • Label-aware Document Representation via Hybrid Attention for Extreme Multi-Label Text Classification
  • XLNet
  • Conversing by Reading: Contentful Neural Conversation with On-demand Machine Reading
  • Cross-lingual Language Model Pretraining
  • Libre Office Translate via PyTorch NMT
  • BERT
  • VSE++: Improved Visual-Semantic Embeddings
  • A Structured Self-Attentive Sentence Embedding
  • Neural Sequence labeling model
  • Skip-Thought Vectors
  • Complete Suite for Training Seq2Seq Models in PyTorch
  • MUSE: Multilingual Unsupervised and Supervised Embeddings
  • TorchMoji: PyTorch Implementation of DeepMoji to under Language used to Express Emotions

Question and Answering

  • Visual Question Answering in Pytorch
  • Reading Wikipedia to Answer Open-Domain Questions
  • Deal or No Deal? End-to-End Learning for Negotiation Dialogues
  • Interpretable Counting for Visual Question Answering
  • Open Source Chatbot with PyTorch

Speech Generation and Recognition

  • PyTorch-Kaldi Speech Recognition Toolkit
  • WaveGlow: A Flow-based Generative Network for Speech Synthesis
  • OpenNMT
  • Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
  • WeNet: Production First and Production Ready End-to-End Speech Recognition Toolkit

Document and Text Classification

  • Hierarchical Attention Network for Document Classification
  • Hierarchical Attention Networks for Document Classification
  • CNN Based Text Classification

Text Generation

  • Pytorch Poetry Generation

Text to Image

  • Stable Diffusion
  • Dall-E 2
  • Dall-E

Translation

  • Open-source (MIT) Neural Machine Translation (NMT) System

Sentiment Analysis

  • Recurrent Neural Networks for Sentiment Analysis (Aspect-Based) on SemEval 2014
  • Seq2Seq Intent Parsing
  • Finetuning BERT for Sentiment Analysis

Deep Reinforcement Learning

  • Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels
  • Exploration by Random Network Distillation
  • EGG: Emergence of lanGuage in Games, quickly implement multi-agent games with discrete channel communication
  • Temporal Difference VAE
  • High-performance Atari A3C Agent in 180 Lines PyTorch
  • Learning when to communicate at scale in multiagent cooperative and competitive tasks
  • Actor-Attention-Critic for Multi-Agent Reinforcement Learning
  • PPO in PyTorch C++
  • Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback
  • Asynchronous Methods for Deep Reinforcement Learning
  • Continuous Deep Q-Learning with Model-based Acceleration
  • Asynchronous Methods for Deep Reinforcement Learning for Atari 2600
  • Trust Region Policy Optimization
  • Neural Combinatorial Optimization with Reinforcement Learning
  • Noisy Networks for Exploration
  • Distributed Proximal Policy Optimization
  • Reinforcement learning models in ViZDoom environment with PyTorch
  • Reinforcement learning models using Gym and Pytorch
  • SLM-Lab: Modular Deep Reinforcement Learning framework in PyTorch
  • Catalyst.RL

Deep Bayesian Learning and Probabilistic Programmming

  • BatchBALD: Efficient and Diverse Batch Acquisition for Deep Bayesian Active Learning
  • Subspace Inference for Bayesian Deep Learning
  • Bayesian Deep Learning with Variational Inference Package
  • Probabilistic Programming and Statistical Inference in PyTorch
  • Bayesian CNN with Variational Inferece in PyTorch

Spiking Neural Networks

  • Norse, Library for Deep Learning with Spiking Neural Networks

Anomaly Detection

  • Detection of Accounting Anomalies using Deep Autoencoder Neural Networks

Regression Types

  • Quantile Regression DQN

Time Series

  • Dual Self-Attention Network for Multivariate Time Series Forecasting
  • DILATE: DIstortion Loss with shApe and tImE
  • Variational Recurrent Autoencoder for Timeseries Clustering
  • Spatio-Temporal Neural Networks for Space-Time Series Modeling and Relations Discovery
  • Flow Forecast: A deep learning for time series forecasting framework built in PyTorch

Synthetic Datasets

  • Meta-Sim: Learning to Generate Synthetic Datasets

Neural Network General Improvements

  • The Artificial Dendrite Network Library for PyTorch
  • In-Place Activated BatchNorm for Memory-Optimized Training of DNNs
  • Train longer, generalize better: closing the generalization gap in large batch training of neural networks
  • FreezeOut: Accelerate Training by Progressively Freezing Layers
  • Binary Stochastic Neurons
  • Compact Bilinear Pooling
  • Mixed Precision Training in PyTorch

DNN Applications in Chemistry and Physics

  • Wave Physics as an Analog Recurrent Neural Network
  • Neural Message Passing for Quantum Chemistry
  • Automatic chemical design using a data-driven continuous representation of molecules
  • Deep Learning for Physical Processes: Integrating Prior Scientific Knowledge
  • Differentiable Molecular Simulation for Learning and Control

New Thinking on General Neural Network Architecture

  • Complement Objective Training
  • Decoupled Neural Interfaces using Synthetic Gradients

Linear Algebra

  • Eigenvectors from Eigenvalues

API Abstraction

  • Torch Layers, Shape inference for PyTorch, SOTA Layers
  • Hummingbird, run trained scikit-learn models on GPU with PyTorch

Low Level Utilities

  • TorchSharp, .NET API with access to underlying library powering PyTorch

PyTorch Utilities

  • Functorch: prototype of JAX-like composable Function transformers for PyTorch
  • Poutyne: Simplified Framework for Training Neural Networks
  • PyTorch Metric Learning
  • Kornia: an Open Source Differentiable Computer Vision Library for PyTorch
  • BackPACK to easily Extract Variance, Diagonal of Gauss-Newton, and KFAC
  • PyHessian for Computing Hessian Eigenvalues, trace of matrix, and ESD
  • Hessian in PyTorch
  • Differentiable Convex Layers
  • Albumentations: Fast Image Augmentation Library
  • Higher, obtain higher order gradients over losses spanning training loops
  • Neural Pipeline, Training Pipeline for PyTorch
  • Layer-by-layer PyTorch Model Profiler for Checking Model Time Consumption
  • Sparse Distributions
  • Diffdist, Adds Support for Differentiable Communication allowing distributed model parallelism
  • HessianFlow, Library for Hessian Based Algorithms
  • Texar, PyTorch Toolkit for Text Generation
  • PyTorch FLOPs counter
  • PyTorch Inference on C++ in Windows
  • EuclidesDB, Multi-Model Machine Learning Feature Database
  • Data Augmentation and Sampling for Pytorch
  • PyText, deep learning based NLP modelling framework officially maintained by FAIR
  • Torchstat for Statistics on PyTorch Models
  • Load Audio files directly into PyTorch Tensors
  • Weight Initializations
  • Spatial transformer implemented in PyTorch
  • PyTorch AWS AMI, run PyTorch with GPU support in less than 5 minutes
  • Use tensorboard with PyTorch
  • Simple Fit Module in PyTorch, similar to Keras
  • torchbearer: A model fitting library for PyTorch
  • PyTorch to Keras model converter
  • Gluon to PyTorch model converter with code generation
  • Catalyst: High-level utils for PyTorch DL & RL research
  • PyTorch Lightning: Scalable and lightweight deep learning research framework
  • Determined: Scalable deep learning platform with PyTorch support
  • PyTorch-Ignite: High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently
  • torchvision: A package consisting of popular datasets, model architectures, and common image transformations for computer vision.
  • Poutyne: A Keras-like framework for PyTorch and handles much of the boilerplating code needed to train neural networks.
  • torchensemble: Scikit-Learn like ensemble methods in PyTorch
  • TorchFix - a linter for PyTorch-using code with autofix support
  • pytorch360convert - Differentiable image conversions between 360° equirectangular images, cubemaps, and perspective projections

PyTorch Video Tutorials

  • PyTorch Zero to All Lectures
  • PyTorch For Deep Learning Full Course
  • PyTorch Lightning 101 with Alfredo Canziani and William Falcon
  • Practical Deep Learning with PyTorch

Community

  • PyTorch Discussion Forum
  • StackOverflow PyTorch Tags
  • Catalyst.Slack

To be Classified

  • Perturbative Neural Networks
  • Accurate Neural Network Potential
  • Scaling the Scattering Transform: Deep Hybrid Networks
  • CortexNet: a Generic Network Family for Robust Visual Temporal Representations
  • Oriented Response Networks
  • Associative Compression Networks
  • Clarinet
  • Continuous Wavelet Transforms
  • mixup: Beyond Empirical Risk Minimization
  • Network In Network
  • Highway Networks
  • Hybrid computing using a neural network with dynamic external memory
  • Value Iteration Networks
  • Differentiable Neural Computer
  • A Neural Representation of Sketch Drawings
  • Understanding Deep Image Representations by Inverting Them
  • NIMA: Neural Image Assessment
  • NASNet-A-Mobile. Ported weights
  • Graphics code generating model using Processing
  • Github Repository
  • Website

Contributions

Do feel free to contribute!

You can raise an issue or submit a pull request, whichever is more convenient for you. The guideline is simple: just follow the format of the previous bullet point or create a new section if it’s a new category.

New Special Dedicated List to AI Agents | The Incredible AI Agents

Feel free to visit The Incredible AI Agents, a curated list of resources on building, evaluating, deploying, monitoring AI Agents. Feel free to view, star, share and/or contribute!

Tag » Cnn Saliency Map Pytorch