Tested Fugu-ultra on the questions that break most of the AI:
- how many r's in strawberry
- is 5.11 bigger than 5.1
- the famous car/wash question (Twice)
I think we just got Fable Back
Ziwen on X: "Tested Fugu-ultra on the questions that break most of the AI: - how many r's in strawberry - is 5.11 bigger than 5.1 - the famous car/wash question (Twice) I think we just got Fable Back https://t.co/F2rcamh3rr" / X