Transformers Projects .

Technology

Transformers

The deep learning architecture that revolutionized sequence modeling (NLP, vision) by replacing recurrent units with a parallelizable multi-head self-attention mechanism.

The Transformer: a neural network architecture introduced in the landmark 2017 paper, "Attention Is All You Need." It eliminated the sequential processing bottleneck of prior Recurrent Neural Networks (RNNs) by relying solely on self-attention, enabling massive parallelization and significantly faster training (up to 10x faster) on modern hardware. This efficiency allowed for the creation of large-scale pre-trained models: BERT (encoder-only) and the generative GPT series (decoder-only). The architecture is now foundational to all modern Large Language Models (LLMs) and drives the current state-of-the-art in AI.

https://doi.org/10.48550/arXiv.1706.03762
146 projects · 51 cities

Related technologies

Recent Talks & Demos

Showing 1-24 of 146

Members-Only

Sign in to see who built these projects

Optimización de recursos para LLMs
Bogotá
Transformers PEFT
Nanochat: Train LLMs from Scratch
Brussels Apr 1
Python Torch
Words to World: AI Models
San Diego Feb 26
Unreal Engine 5 PyTorch
Hugging Face RAG: Reduce Hallucinations
Tiruchirappalli Jan 31
Transformers RAG
Transformers Detect Netflow Anomalies
Toronto Jan 29
Python Transformers
Biological Age from Blood Work
Seattle Dec 18
GPT-4 OpenAI API
x402-Enabled AI Gateway
Atlanta Dec 16
GPT-4 LangChain
SLM Fine-tuning on 16GB CPU
Waterloo Dec 15
LangChain Transformers
AI-First Clinical Trials EDC
Chicago Dec 9
React Spring Boot
fastworkflow: SOTA with Small Models
Houston Dec 9
GPT-4 Claude-3
Science of Intelligence
Portland Dec 3
GPT-4 LangChain
NotebookLM: Grounded Academic Research
Asuncion Nov 27
GPT-4 LangChain
AI Management: Robotics Safety Standards
Hong Kong Nov 27
GPT-4 LangChain
Paradigm: Understand Legacy Code
Poland Nov 26
GPT-4 LangChain
Constrained Decoding: LLM Pixel Art
Montreal Nov 20
Modal Transformers
GPT-5 Ad Campaign Simulator
Boston Nov 17
GPT-4 LangChain
Finetuning SLMs for Agents
Amsterdam Nov 11
Distill Labs Transformers
Instruct Lab LLM Evaluation Playbook
Toronto Nov 10
Merlinite-7B-Lab Mistral Mixtral
Tracking AI code
New York City Nov 6
GPT-4 Llama-2
WikiMem
Minneapolis Saint Paul Nov 5
GPT-4 LangChain
Secure AI Health Assistant with EHR
Dhaka Nov 1
OpenAI API FastAPI
Optimizing Agent Latency with Evals
San Francisco Oct 30
GPT-4 LangChain
RapidFire AI: Parallel LLM Experimentation
San Diego Oct 29
PyTorch Transformers
IA Aplicada: Embeddings y Validación GPT
Santiago Oct 29
GPT-4 LangChain