DeepSpeed

DeepSpeed is an open source deep learning optimization library for PyTorch.[1] The library is designed to reduce computing power and memory use and to train large distributed models with better parallelism on existing computer hardware.[2][3] DeepSpeed is optimized for low latency, high throughput training. It includes the Zero Redundancy Optimizer (ZeRO) for training models with 1000 billion or more parameters.[4] Features include mixed precision training, single-GPU, multi-GPU, and multi-node training as well as custom model parallelism. The DeepSpeed source code is licensed under MIT License and available on GitHub.[5]

DeepSpeed

Original author(s)	Microsoft Research
Developer(s)	Microsoft
Initial release	May 18, 2020 (2020-05-18)

Stable release	v0.5.10 / January 14, 2022 (2022-01-14)

Repository	github.com/microsoft/DeepSpeed
Written in	Python, CUDA, C++
Type	Software library
License	MIT License
Website	deepspeed.ai

The team claimed to achieve up to a 6.2x throughput improvement, 2.8x faster convergence, and 4.6x less communication.[6]

References

"Microsoft Updates Windows, Azure Tools with an Eye on The Future". PCMag UK. May 22, 2020.
Yegulalp, Serdar (February 10, 2020). "Microsoft speeds up PyTorch with DeepSpeed". InfoWorld.
"Microsoft unveils "fifth most powerful" supercomputer in the world". Neowin.
"Microsoft trains world's largest Transformer language model". February 10, 2020.
"microsoft/DeepSpeed". July 10, 2020 – via GitHub.
"DeepSpeed: Accelerating large-scale model inference and training via system optimizations and compression". Microsoft Research. 2021-05-24. Retrieved 2021-06-19.

External links

Deep learning software
Open source	Apache MXNet Apache SINGA Caffe Deeplearning4j DeepSpeed Dlib Keras Microsoft Cognitive Toolkit ML.NET OpenNN PyTorch TensorFlow Theano Torch ONNX OpenVINO
Proprietary	Apple Core ML IBM Watson Maple Neural Designer Wolfram Mathematica MATLAB Deep Learning Toolbox
Category Comparison

Microsoft free and open-source software (FOSS)

Overview

Software

Applications	Atom Conference XP Family.Show File Manager Open Live Writer Microsoft PowerToys Terminal Windows Calculator Windows Console Windows Package Manager WorldWide Telescope XML Notepad
Video games	Allegiance
Programming languages	Bosque C# Dafny F# F* GW-BASIC IronPython IronRuby Lean P Power Fx PowerShell Project Verona Q# R Open Small Basic Online TypeScript Visual Basic
Frameworks and development tools	.NET .NET Bio .NET Framework .NET Gadgeteer .NET MAUI .NET Micro Framework AirSim ASP.NET ASP.NET AJAX ASP.NET Core ASP.NET MVC ASP.NET Razor ASP.NET Web Forms Babylon.js BitFunnel Blazor C++/WinRT CCF ChakraCore CLR Profiler Dapr DeepSpeed DiskSpd Dryad Dynamic Language Runtime eBPF on Windows Electron Entity Framework Fluent Design System Fluid Framework Infer.NET LightGBM Managed Extensibility Framework Microsoft Automatic Graph Layout Microsoft C++ Standard Library Microsoft Cognitive Toolkit Microsoft Detours Microsoft Enterprise Library Microsoft SEAL Mimalloc ML.NET mod_mono Mono MonoDevelop MSBuild MsQuic Neural Network Intelligence npm NuGet OneFuzz Open Management Infrastructure Open Neural Network Exchange Open Service Mesh Open XML SDK Orleans ProcDump ProcMon Python Tools for Visual Studio R Tools for Visual Studio RecursiveExtractor Roslyn Sandcastle SignalR StyleCop SVNBridge T2 Temporal Prover Text Template Transformation Toolkit TLA+ Toolbox U-Prove vcpkg Virtual File System for Git Visual Studio Code Voldemort VoTT Vowpal Wabbit Windows App SDK Windows Communication Foundation Windows Driver Frameworks KMDF UMDF Windows Forms Windows Presentation Foundation Windows Template Library Windows UI Library WinJS WinObjC WiX XSP xUnit.net Z3 Theorem Prover
Operating systems	MS-DOS (v1.25 & v2.0) Barrelfish SONiC CBL-Mariner
Other	ChronoZoom Extensible Storage Engine FlexWiki FourQ Gollum Project Mu ReactiveX SILK TLAPS TPM 2.0 Reference Implementation WikiBhasha

Licenses

Forges

Category

Microsoft Research (MSR)

Main
projects

Languages, compilers	Bartok Bosque Cω F* Lean P Project Verona Phoenix Polyphonic C# SecPAL
Distributed–grid computing	Bigtop Gridline BitVault Confidential Consortium Framework DeepSpeed Orleans
Internet, networking	AjaxView Avalanche Conference XP Gazelle HoneyMonkey Penny Black Wallop WikiBhasha
Other projects	Cognitive Toolkit Digits Holoportation IllumiRoom Image Composite Editor Infer.NET LightGBM LiveStation MyLifeBits Neural Network Intelligence NodeXL OneFuzz PhotoDNA SEAL SLAM Terminator WorldWide Telescope Z3 Theorem Prover
Operating systems	Barrelfish HomeOS Midori Singularity Verve
APIs	Accelerator Dryad Joins mimalloc SXM
Launched as products	C# Comic Chat Detours F# Sideshow PixelSense (TouchLight) SenseCam ClearType Group Shot Allegiance TrueSkill Songsmith Xbox Kinect

MSR Labs
applied
research

Live Labs

Current	Pivot Seadragon Deep Zoom DeepZoomPix
Discontinued	Deepfish Listas Live Clipboard Photosynth Volta

FUSE Labs

Other labs

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

DeepSpeed

See also

References

Further reading

External links