OpenAIreleased a bunchofexamples, likethisguymaking a playablesnakegamein a singleshot, orthisguycreating a nonogrampuzzle.
Andthemodelcanevenreliablytellyouhowmany R's areinthewordstrawberry, a questionthathasbaffledLLMsinthepast.
Actually, justkidding, itfailedthattestwhen I triedtorunitmyself.
Andtheactualchainofthoughtishiddenfromtheenduser, eventhoughyoudohavetopayforthosetokensat a priceof $60 per 1 million.
However, theydoprovidesomeexamplesofchainofthought, likeinthiscodingexamplethattransposes a matrixinBash.
You'llnoticethatitfirstlooksattheshapeoftheinputsandoutputs, thenconsiderstheconstraintsoftheprogramminglanguage, andgoesthrough a bunchofotherstepsbeforeregurgitating a response.
Butthisisthefirsttime a modellikethishasbecomegenerallyavailabletothepublic.
Let's goaheadandfindoutifitslaps.
I rememberyearsagowhen I firstlearnedcode, I recreatedtheclassicMS-DOSgameDogWars, a turn-basedstrategygamewhereyouplaytheroleof a travelingsalesmanandhaverandomencounterswithOfficerHardass.
As a biologicalhuman, ittookmelike a hundredhourstobuild.
Butlet's firstseehowGPT-4-0 doeswithit.
When I askittobuildthisgamein C with a GUI, itproducescodethatalmostworks, but I wasn't abletogetittocompile, andafter a coupleoffollow-upprompts, I finallygotsomethingworking, butthegamelogicwasverylimited.
Nowlet's givethenew 0-1 thatexactsameprompt.
Whatyou'llnoticeisthatitgoesthroughthechainofthought, likeit's thinking, thenassessingcompliance, andsoon, butwhatit's actuallydoingunderthehoodiscreatingthosereasoningtokens, whichshouldleadto a morecomprehensiveandaccurateresult.
IncontrasttoGPT-4, 0-1 compiledrightaway, anditfollowedthegamerequirementsto a T.
Atfirstglance, itactuallyseemedlike a flawlessgame, butitturnsouttheappwasactuallyprettybuggy.
I keptgettingintothisinfiniteloopwithOfficerHardass, andtheUIwasalsoterrible.
I triedtofixtheseissueswithadditionalfollow-upprompts, buttheyactuallyledtomorehallucinationsandmorebugs, andit's prettyclearthatthismodelisn't trulyintelligent.
Thatbeingsaidthough, there's a hugeamountofpotentialwiththischainofthoughtapproach, andbypotential, I meanpotentialtooverstateitscapabilities.
In 2019, theyweretellingusGPT-2 wastoodangeroustorelease.