OpenAI 模型破解 80 年數學猜想：科研自動化時代 indie hacker 點上車

數學界仲未回過神來。Five days ago, OpenAI 公佈咗一個內部 reasoning model 破解咗 Paul Erdős 喺 1946 年提出嘅平面單位距離問題（unit distance problem），而且係用代數數論入面嘅 class field towers 同 Golod–Shafarevich theory 呢類深水炸彈式工具——唔係砌個更大嘅 grid、唔係 brute force search，係由零推出一套全新嘅數學 construction，推翻咗學界堅信咗八十年嘅猜想。

Tim Gowers 話呢個係「AI 數學嘅里程碑」，Arul Shankar 話個模型「唔止係助手，而係有能力產生原創 ideas 然後執行到底」。但作為 indie hacker，我哋最關心嘅問題唔係純數學——而係：一個 general-purpose reasoning model 可以自主解開未解決嘅數學問題，咁我哋可以點用 API 將呢種能力轉化成 product？

唔好睇錯重點：關鍵係「general-purpose reasoning」

好多報導聚焦喺「AI 破解數學猜想」呢個 headline，但真正值得 indie hacker 留意嘅係呢句出自 OpenAI 官方文章：「The proof came from a new general-purpose reasoning model, rather than from a system trained specifically for mathematics, scaffolded to search through proof strategies, or targeted at the unit distance problem in particular。」

即係話，呢個模型冇特別 fine-tune 過數學、冇畀佢一個數學專用嘅 search tree、冇針對「單位距離問題」做任何定制。佢就係一個 pure reasoning model，用 test-time compute 去思考、去推論、去嘗試證明或反證。OpenAI 搵咗一批 Erdős 嘅遺留問題畀佢試，佢就揀咗其中一個嚟解決。

呢個 pattern 對我哋做 product 嘅人嚟講極之重要。如果你要用 AI 做科研自動化，你唔使訓練一個「化學專用模型」或者「生物學專用模型」。你只需要一個夠強嘅 reasoning model，然後畀佢 access 到你 domain 嘅工具和 data——佢自然會 connect the dots。呢個係 platform play，唔係 vertical play。Indie hacker 嘅優勢就喺呢度：你唔使同大公司鬥 train model，你只需要用佢哋嘅 API 去 build 上層 application。

科研自動化嘅新 workflow：從 heuristic 到 reasoning

過去一年，科研 automation 嘅主流做法係「tool-calling agent」——叫做一個 LLM，畀佢 access 文獻搜尋、code execution、數據庫，然後佢會幫你做實驗或分析。呢種做法 work，但 limitation 好明顯：LLM 好多時只係做 retrieval and summarization，真正嘅「科研洞察」仍然要靠人。

Reasoning model 改變咗呢個局面。睇返 OpenAI 個 case，佢哋披露咗個 model 嘅 chain of thought，發現「a significant majority of the thoughts are trying to construct a counterexample to the widely believed upper bound, rather than trying to prove it」。換句話說，個 model 冇盲從主流 belief，而係自己判断咗「呢個猜想可能係錯」，然後主動去搵 counterexample。

呢種獨立判斷能力，先係 game-changer。以前你叫 AI 做科研，佢係個「聰明的助手」——幫你 summarize paper、寫 code、做可視化。但佢唔會 challenge 主流觀點，唔會話「你覺得呢個 hypothesis 係真，但我覺得係假」。而家 reasoning model 會。

對於 indie hacker 嚟講，呢個意味住你可以開始 build 一啲「AI 研究員」product，而唔止係「AI 研究助理」。分別在於：研究助理幫你慳 time，研究員幫你發現 knowledge gap。

三個 indie hacker 可以即刻上車嘅方向

第一個方向：文獻級嘅 hypothesis generation tool。Reasoning model 嘅長處係可以將唔同領域嘅知識串連埋一齊。你可以俾佢讀一個領域嘅 50 篇論文，然後叫佢 propose 幾個「如果將 field A 嘅方法用喺 field B 嘅問題，會唔會有突破？」嘅 hypothesis。呢種 cross-domain reasoning 以前只可以靠 senior researcher 嘅直覺，而家可以 scale。

第二個方向：自動化嘅 disproval pipeline。OpenAI 個 model 成功嘅原因之一係佢選擇咗「試圖 disprove 而非 prove」。好多人類研究員有 confirmation bias——傾向搵證據支持自己嘅 theory。Reasoning model 冇呢個 bias。你可以 build 一個 tool，讓 researcher 提交一個 conjecture，然後 AI 自動嘗試搵 counterexample。如果搵唔到，confidence 提升；如果搵到，慳返幾個月嘅 wasted effort。

第三個方向：數學/科學嘅「思路探索」平台。Reasoning model 嘅 chain of thought 本身就係有價值嘅 output。OpenAI 公開咗部分 CoT，入面紀錄咗個模型點樣逐步思考、點樣撞牆、點樣 pivoting。呢啲嘢對於教學和科研都非常有價值。你可以 build 一個 platform，讓用戶提交問題，然後 AI 輸出完整嘅 reasoning trace——唔止係答案，係思路地圖。

唔好等 infrastructure 成熟，而家就出手

有人會話：「呢個 model 係 OpenAI 内部用嘅，未公開做 API。」呢句係事實，但都只係暫時性。GPT-5.5 同之前嘅 reasoning models 已經有 API 可以 call，capabilities 每個月都在升。等「perfect model」先出手永遠係輸家策略。

更重要嘅係，你而家可以開始做幾件事：第一，熟悉 reasoning API 嘅 prompting technique——點樣引導模型做 multi-step reasoning、點樣畀佢 access external tools、點樣 parse 佢嘅 CoT。第二，揀一個 niche 嘅科研 domain，唔好同大公司爭 general stuff。小眾領域（例如特定材料科學問題、特定生物 pathway）嘅數據規模細、domain knowledge 門檻高，反而係 indie hacker 嘅 sweet spot。第三，開始試 prototype。OpenAI 公佈咗呢個結果之後，數學界同科研 funding 機構會大量投入資源去探索 reasoning model 嘅應用。2026 下半年會係科研 automation startup 嘅黃金窗口。

Erdős 嘅問題等咗 80 年先有人（或者 AI）破解。但下一波突破唔會再等咁耐。問題係：你嘅 product 會唔會係其中一個答案？