Okay, Sothefirstthingwewanttodoisdo a quickchecktoseeshouldweactuallytrain?
Sorecallwe'regonnahavethisreplaymemoryandthenfromthereplaymemory, whichshouldbequite a largememory, we'regonnagrab a minibatchthat's relativelysmall, butalsois a batchsizethatisofdecentsize.
Sotypicallywithneuralnetworksliketotraininbatchesof, like, 32 or 64 somethinglikethat.
Um, sowe'regonnadothesamethinghere, sowewant 32 or 64 tobeprettysmallcomparedtothesizeofourmemories.
Sointhiscase, I wanttosayourmaxmemorysizeis 50,000 um, andthentheleastamountthatwe'rewillingtotrainonits 10,000.
We'lltowupdatethatfor a neuralnetwork, likeinourtable.
Wejustupdateditright.
You'resomebodythatonevalue.
Butintheneuralnetwork, weoutputthesfourvalues.
Sowhatweenduphavingtodoisweupdatethis 9.5 to 8.5, likein a list.
Andthenwerefittheneuralnetworktoinsteadbe 3.28 point 57.21 pointthree.
Sothat's thenextthingthatweneedtodo.
Socomingdownherewhatwewanttosaystill, inourfourloop, we'regonnasaycurrentcuesequalsthecurrentcueslistattheindexthatwereataswe'reiteratingoverand I misssublimetext.
Uh, we'reiteratingoverYes, thestwovaluesindex.
Andthenyoushouldbeabletoguesswhat I meanttohavehere.
Andthatwasanenumerates, um, andalsoinwho?
Maybetoday's not a dayfortutorials.
Forme, she's onthenmanybatches.
Pridegoingoffofthescreenhere, seeif I canhelpouthere.
Therewego.
Sofourindex, currentstateactionrewardnewcurrentstatedonein a numeratorminibatch.
I guess I gotstucktherebecause I wasexplainingtheseandthenpointingoutthetransitions.
Anyway, nowit's a validloop.
Soifnotdone, anybodyknows.
IfAdamjusthas a builtinpet a kindofsyntaxchecker, letmeknow.
Postcountbelow.
Becausethisiswhatcomesonthepaperspacemachines.
And I justcan't beaskedtoalwaysbethrowingonsublimetextanyway.
Um, I didcopyandpastesomethings, and I knowsomeone's gonnabelike, youmadefunofpeopleforcopyingyoupaid.
What I meantbeforewasthatpeoplewerecopyingandpastingfromeachotherinthiscase, thethingsthatcopyandpaste, itwaslikethestuffthatwe'vealreadywritten, wherethethingsthatdon't matter.
So, um, feelfreetoletmeknowifyoulikethatordislikethatitwouldhavetakenprobablytwomorevideosforustorewritethedotheenvironmentthing.
Um, andtodowhatwastheotherthingwecopyandpaste.
We'llalsothetentsareboredblob m andwedidn't reallymade a coupleofchanges.
Thatwouldbeevenlikeallthosestupid l liftslikeComeon, you'rewelcome.
So, anyways, letmeknowifyoulikethatorjustlikethat, but I thinkthatmadethemostsense.
SelfdotmodeldotPredictSoftwas I didn't getcuesmodelPredict.
Yeah, itwasn't getcues.
Interesting.
Okay, soit's actuallytraining.
ThanktheLord.
Umyeah, interestingself.
ThaThat's sofunny.
I wonderhowthathowthatmadeitin.
Maybecodeuptothis, I guess, because I just I typeofitintheprevioustutorialandthen I copiedittohear I'm surprisednobodypickedthatupintheprevioustutorialtobehonestthatthatthatone's alreadybeenreleasedforthecurrentchannelmembersabout 100 people.
So I'm surprisednobodynoticedthatonewaslike, What's modelpredict?
Umand I thinkthatshouldbeeverythingthatyouwouldneedtoknowasfaraschangingtoefromwhat I justcoveredinthetutorialanyway, sowecouldloadmodelaggregateeveryshowpreview.