o1

Star

Here are 38 public repositories matching this topic...

melih-unsal / DemoGPT

Star

🤖 Everything you need to create an LLM Agent—tools, prompts, frameworks, and models—all in one place.

Updated May 9, 2025
Python

heshengtao / comfyui_LLM_party

Star

LLM Agent Framework in ComfyUI includes MCP sever, Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfaces, such as o1,ollama, gemini, grok, qwen, GLM, deepseek, kimi,doubao. Adapted to local llms, vlm, gguf such as llama-3.3 Janus-Pro, Linkage graphRAG

Updated May 10, 2025
Python

szczyglis-dev / py-gpt

Sponsor

Star

Desktop AI Assistant powered by o1, o3, GPT-4, GPT-4 Vision, Gemini, Claude, Llama 3, DeepSeek, Bielik, DALL-E, chat, vision, voice control, image generation and analysis, agents, command execution, file upload/download, speech synthesis and recognition, access to Web, memory, presets, assistants, plugins, and more. Linux, Windows, Mac

Updated Mar 6, 2025
Python

zzli2022 / Awesome-System2-Reasoning-LLM

Star

Latest Advances on System-2 Reasoning

benchmark mcts rl reasoning r1 prm o3 o1 slow-fast system-2 self-improve macro-action

Updated Apr 23, 2025
Python

sunnynexus / Search-o1

Star

Search-o1: Agentic Search-Enhanced Large Reasoning Models

math livecode amc reasoning r1 rag qwq aimo o1 gpqa

Updated May 4, 2025
Python

RUC-NLPIR / WebThinker

Star

🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability

gaia reasoning hle reportgen o3 qwq webwalker o1 deepsearch deepseek-r1 gpqa deepresearch

Updated May 10, 2025
Python

modelscope / awesome-deep-reasoning

Star

Collect every awesome work about r1!

collection rl reasoning r1 o1 qwen deepseek grpo

Updated May 2, 2025
Python

tcsenpai / multi1

Star

multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at once.

api ai local remote openai chains reasoning perplexity groq o1 llm ollama litellm

Updated Jan 29, 2025
Python

RyanLiu112 / compute-optimal-tts

Star

Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".

r1 o1 large-language-model process-reward-model test-time-scaling

Updated Feb 19, 2025
Python

pseudotensor / open-strawberry

Star

Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported) Demo: https://huggingface.co/spaces/pseudotensor/open-strawberry

openai reasoning groq o1 chain-of-thought anthropic llama3

Updated Oct 15, 2024
Python

InternLM / OREAL

Star

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

mathematics rl reasoning o1 llm

Updated Mar 20, 2025
Python

AdieLaine / multi-agent-reasoning

Star

The Multi-Agent Reasoning framework creates an interactive chatbot where AI agents collaborate via structured reasoning and Swarm Integration for optimal answers. Simulating a team that discusses, debates, and refines responses, it enables complex problem-solving and precise results. Now with Prompt Caching to reduce latency and costs.

python chatbot multi-agent openai swarm agent-based-modeling reasoning o1 prompt-caching

Updated Jan 23, 2025
Python

CJReinforce / PURE

Star

Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"

reinforcement-learning mathematics rl reasoning r1 o1 llm reinforcement-finetuning

Updated May 6, 2025
Python

RyanLiu112 / GenPRM

Star

Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".

r1 o1 large-language-model process-reward-model test-time-scaling

Updated Apr 24, 2025
Python

ritzz-ai / GUI-R1

Star

Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents

deep-reinforcement-learning r1 multimodal o1 multimodal-large-language-models large-multimodal-models gui-agent grpo mllm-reasoning

Updated May 5, 2025
Python

UCSC-VLAA / o1_medical

Star

benchmark o1 medical-llm

Updated Feb 26, 2025
Python

The-Swarm-Corporation / AgentGym

Star

A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1

ai rl agents alibaba r1 o1 llms qwen deepseek

Updated Apr 21, 2025
Python

0xrushi / Terminal-Voice-Assistant

Star

Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language commands.

terminal voice assistant o1 openinterpreter

Updated Jun 9, 2024
Python

sylvain-wei / 24-Game-Reasoning

Star

超简单复现Deepseek-R1-Zero和Deepseek-R1，以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL，以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of DeepSeek R1-Zero, DeepSeek R1

alignment reasoning r1 post-training cot sft o1 24game llm rlhf deepseek r1-zero verl long-cot

Updated Apr 5, 2025
Python

Ruiyang-061X / Uncertainty-o

Star

✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Models".

uncertainty reasoning multimodal model-agnostic o1 large-language-models chain-of-thought large-multimodal-models hallucination-detection hallucination-mitigation

Updated Mar 13, 2025
Python

Improve this page

Add a description, image, and links to the o1 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the o1 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

o1

Here are 38 public repositories matching this topic...

melih-unsal / DemoGPT

heshengtao / comfyui_LLM_party

szczyglis-dev / py-gpt

zzli2022 / Awesome-System2-Reasoning-LLM

sunnynexus / Search-o1

RUC-NLPIR / WebThinker

modelscope / awesome-deep-reasoning

tcsenpai / multi1

RyanLiu112 / compute-optimal-tts

pseudotensor / open-strawberry

InternLM / OREAL

AdieLaine / multi-agent-reasoning

CJReinforce / PURE

RyanLiu112 / GenPRM

ritzz-ai / GUI-R1

UCSC-VLAA / o1_medical

The-Swarm-Corporation / AgentGym

0xrushi / Terminal-Voice-Assistant

sylvain-wei / 24-Game-Reasoning

Ruiyang-061X / Uncertainty-o

Improve this page

Add this topic to your repo