zerojames

2025-06-27 12:34

Submitted: "LLM Speedrunner: Eval for frontier models to reproduce scientific findings"

1 points0 commentsgithub.com

The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in language modeling. - facebookresearch/llm-speedrunner

2025-06-26 4:07

Submitted: "Anthropic Desktop Extensions: One-click local MCP installation in desktop apps"

4 points2 commentsgithub.com

Desktop Extensions: One-click local MCP server installation in desktop apps - anthropics/dxt

2025-06-25 7:06

Submitted: "GenAI Processors: Modular, Asynchronous, and Composable AI Pipelines"

1 points0 commentsgithub.com

GenAI Processors is a lightweight Python library that enables efficient, parallel content processing. - google-gemini/genai-processors

2025-06-25 2:29

Submitted: "AlphaGenome"

4 points0 commentsgithub.com

This API provides programmatic access to the AlphaGenome model developed by Google DeepMind. - google-deepmind/alphagenome

2025-06-11 2:13

Submitted: "OpenAI o3-pro: Multimodal and Vision Analysis"

1 points0 commentsblog.roboflow.com

Explore how OpenAI o3-pro does on a range of use cases, from defect detection to object counting to VQA.

Hacker News

zerojames

3622

2022-01-05

About Me

Recent Activity

Submitted: "LLM Speedrunner: Eval for frontier models to reproduce scientific findings"

Submitted: "Anthropic Desktop Extensions: One-click local MCP installation in desktop apps"

Submitted: "GenAI Processors: Modular, Asynchronous, and Composable AI Pipelines"

Submitted: "AlphaGenome"

Submitted: "OpenAI o3-pro: Multimodal and Vision Analysis"

HackerNews