Transformers Projects .

Technology

Transformers

The deep learning architecture that revolutionized sequence modeling (NLP, vision) by replacing recurrent units with a parallelizable multi-head self-attention mechanism.

The Transformer: a neural network architecture introduced in the landmark 2017 paper, "Attention Is All You Need." It eliminated the sequential processing bottleneck of prior Recurrent Neural Networks (RNNs) by relying solely on self-attention, enabling massive parallelization and significantly faster training (up to 10x faster) on modern hardware. This efficiency allowed for the creation of large-scale pre-trained models: BERT (encoder-only) and the generative GPT series (decoder-only). The architecture is now foundational to all modern Large Language Models (LLMs) and drives the current state-of-the-art in AI.

https://doi.org/10.48550/arXiv.1706.03762
146 projects · 51 cities

Related technologies

Recent Talks & Demos

Showing 81-104 of 146

Members-Only

Sign in to see who built these projects

OPEA: Production Multi-Agent Systems
Toronto Jun 18
OpenAI API Kubernetes
Browser Use: Automate Luma Sign-up
Toronto Jun 18
GPT-4 Google Gemini
Founder
Nerds Like Me Jun 17
GPT-4 LangChain
AiLightened Health
Cincinnati Jun 12
GPT-4 LangChain
Quantum Proof AI
Cincinnati Jun 12
GPT-4 LangChain
CaseRace
Cincinnati Jun 12
GPT-4 LangChain
End-to-End AI Pipelines
Nairobi Jun 11
GPT-4 LangChain
Transformers: Latent Space Attractors
Milan Jun 10
PyTorch Transformers
Wan 2.1 & ComfyUI Video Control
Los Angeles Jun 9
Stable Diffusion ComfyUI
MCP: LLM Architectural Design
Hong Kong Jun 6
GPT-4 LangChain
8-Bit Oracle: Building Process
Hong Kong Jun 6
GPT-4 LangChain
Holon: Voice Chat Agents
Hong Kong Jun 6
GPT-4 LangChain
AI Preserving East Asian Texts
Hong Kong Jun 6
GPT-4 LangChain
4x 3090 AI Rig Build
Berlin Jun 3
NVIDIA Transformers
Image-Aware Content Extraction
Berlin Jun 3
GPT-4 Transformers
LLM Reasoning Without Objective
Berlin Jun 3
GPT-4 LangChain
DSPy: Self-Programming Meta-Agents
New York City Jun 3
DSPY vLLM
ML for Government Transparency
New York City Jun 3
Llama-4 Gemini
Mnemosyne: Decentralized LLM Memory
Waterloo Jun 2
Deepseek R1 NodeJS
MCP: Automate Business with Markdown
Seattle May 30
Composio Cursor
MLOps PaaS: Mining AI Startup
Santiago May 29
MLOps OpenAI API
Poseido: Multiagente Windows GUI
Manizales May 28
GPT-4 LangChain
Laminar: Self-Healing AI Integrations
Toronto May 22
Claude-3 LangChain
TurboAPI: High-Performance AI Backends
Singapore May 21
Satya Starlette