I trained a Language Model to schedule events with GRPO!
By
β’
β’
44Bamba-9B-v2 - Fast and powerful!
By
and 12 others
β’
β’
26Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios
By
and 3 others
β’
β’
18Uncensor any LLM with abliteration
By
β’
β’
542Creating your custom Ghibli Text-to-Image model
By
and 3 others
β’
β’
13π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It?
By
β’
β’
227DeepWiki: Best AI Documentation Generator for Any Github Repo
By
β’
β’
12Introduction to State Space Models (SSM)
By
β’
β’
127Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time
By
and 4 others
β’
β’
31Building Multimodal RAG Systems: Supercharging Retrieval with MultiModal Embeddings and LLMs
By
β’
β’
6Code a simple RAG from scratch
By
β’
β’
64PipelineRL
By
and 3 others
β’
β’
17ColPali: Efficient Document Retrieval with Vision Language Models π
By
β’
β’
242Efficient LLM Pretraining: Packed Sequences and Masked Attention
By
β’
β’
38What is test-time compute and how to scale it?
By
and 1 other
β’
β’
82OpenManus: The Open Source Alternative to Manus AI
By
β’
β’
13A Guide to Running Qwen 3 Locally with Ollama and vLLM
By
β’
β’
4Agent2Agent and MCP: An End-to-End Tutorial for a complete Agentic Pipeline
By
β’
β’
4Merge Large Language Models with mergekit
By
β’
β’
115An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct
By
β’
β’
60