Hacker Newsnew | past | comments | ask | show | jobs | submit | kcorbitt's submissionslogin
1.Show HN: RULER – Easily apply RL to any agent (openpipe.ai)
81 points by kcorbitt 89 days ago | past | 11 comments
2.Everything I know about reward hacking (openpipe.ai)
3 points by kcorbitt 3 months ago | past
3.Show HN: ART – a new open-source RL framework for training agents (github.com/openpipe)
116 points by kcorbitt 5 months ago | past | 12 comments
4.ART·E: how we built an email research agent that beats o3 (openpipe.ai)
3 points by kcorbitt 5 months ago | past | 2 comments
5.Using GRPO to Beat o1, o3-mini and R1 at “Temporal Clue” (openpipe.ai)
199 points by kcorbitt 7 months ago | past | 55 comments
6.Analyzing OpenAI's Reinforcement Fine-Tuning: Less Data, Better Results (openpipe.ai)
4 points by kcorbitt 9 months ago | past
7.Using reinforcement learning and $4.80 of GPU time to find the best HN post (openpipe.ai)
217 points by kcorbitt 11 months ago | past | 95 comments
8.Show HN: Agent.exe, a cross-platform app to let 3.5 Sonnet control your machine (github.com/corbt)
406 points by kcorbitt 11 months ago | past | 232 comments
9.DPO fine-tuning outperforms SFT (openpipe.ai)
1 point by kcorbitt on Oct 2, 2024 | past
10.OpenPipe Mixture of Agents: Outperform GPT-4 at 1/25th the Cost (openpipe.ai)
13 points by kcorbitt on June 20, 2024 | past | 2 comments
11.What we've learned in 3 days of Llama 3 (openpipe.ai)
3 points by kcorbitt on April 22, 2024 | past
12.Mixtral Curious? Comparing Mistral 7B and Mixtral for fine-tuning (openpipe.ai)
1 point by kcorbitt on Feb 29, 2024 | past
13.S-LoRA: Serving Thousands of Models from One GPU for Fun and Profit (openpipe.ai)
1 point by kcorbitt on Jan 18, 2024 | past
14.Is AI the next crypto? Insights from HN comments (openpipe.ai)
237 points by kcorbitt on Nov 8, 2023 | past | 367 comments
15.Fine-tune your own Llama 2 to replace GPT-3.5/4
955 points by kcorbitt on Sept 12, 2023 | past | 181 comments
16.Show HN: Automatically convert your GPT-3.5 prompt to Llama 2
13 points by kcorbitt on Aug 9, 2023 | past | 2 comments
17.TaxyAI: Open-source browser automation with GPT-4 (github.com/taxyai)
355 points by kcorbitt on March 28, 2023 | past | 99 comments
18.Tell HN: YC will help you find a co-founder
433 points by kcorbitt on July 6, 2021 | past | 131 comments
19.YC Startup School for future founders who aren't quite ready to start yet (blog.ycombinator.com)
330 points by kcorbitt on Oct 30, 2020 | past | 87 comments
20.YC's Startup School Relaunching as Continuous Program (blog.ycombinator.com)
330 points by kcorbitt on June 17, 2020 | past | 58 comments
21.As Uber and Tesla struggle with driverless cars, Waymo moves forward (arstechnica.com)
76 points by kcorbitt on June 1, 2018 | past | 48 comments
22.McKinsey: One-third of US workers could be jobless by 2030 due to automation (cnbc.com)
5 points by kcorbitt on Nov 29, 2017 | past | 1 comment
23.Startup Ideas (blog.ycombinator.com)
614 points by kcorbitt on Nov 16, 2017 | past | 453 comments
24.Sam Altman presents a political vision for California and the U.S (techcrunch.com)
1 point by kcorbitt on July 13, 2017 | past
25.A ReasonReact Tutorial (jaredforsyth.com)
161 points by kcorbitt on July 6, 2017 | past | 28 comments
26.US Household Debt Surpasses 2008 High (nytimes.com)
250 points by kcorbitt on May 17, 2017 | past | 418 comments
27.New attack that cripples HTTPS crypto works on Macs, Windows, and Linux (arstechnica.com)
250 points by kcorbitt on July 26, 2016 | past | 68 comments
28.Evernote limits free tier to two devices, raises prices 40% (arstechnica.com)
13 points by kcorbitt on June 29, 2016 | past | 1 comment
29.Report: Apple is approving apps more quickly to increase Services revenue (arstechnica.com)
1 point by kcorbitt on May 12, 2016 | past
30.One Day with React Native for Android (corbt.com)
2 points by kcorbitt on Sept 16, 2015 | past

Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: