I wanttotodaytodrawyourattentiontothefollowingcattlechallenge, obviouslyhighlyrelevanttotoday.
Butalso I findthistoappeartobemuchmorelikewhatthethetypesoftasks, atleastforme, that I havefoundbusinessestoneedwhenpeoplecometomefor, likecontractingandconsulting, thedatatendstolookmorelikethisthanyouraveragecattlecompetition.
Nownothingagainstkaraokecompetitions.
There's definitelytruecompetitionthere.
Ah, butgenerallycattlecomesdownto, um, a competitionofoptimizationratherthan a competitionoffindinginsightsinhighlyhighlyunstructureddata.
Thiswouldbereallygoodtodo a videoonitjustbarelypeakedatthedataandthenwe'regonnadiveinby a codeandyouwillseekindoffirsthandtheprocessthat I atleastbeginwithlookingthrough a dataset.
Thiskindofdataset, though I mean, youwillspend, youknow, 22 hundredsofhours, youknow, goingthroughhere, tryingtogetinsight.
Youmightthink, um, orevencrazier, I'veseenquite a fewthings.
I'veseensomestuff.
Y'allUhOkay, soOkay, sowe'vegotthingsorganizedtoanextent, andthen I guessthenextthing I woulddoisorganizeitinto, um I guessitdependsonwhereourschoolis.
Sonow, comingbackhereandgoingtotasks, youknow, I'm lookingthroughthislist, and I thoughtaboutit a littlebitinitially, butwth e I thinkyouknowtheproblemwithwithextractingmeaningfromTexasisyouwannastartwiththelowesthangingfruitfirst.
Soonetogethertermsthat I thinkishighlyunlikelytobechanginginin a t leastscholarlyjournalsaboutthisisincubation.
Thetermincubation.
Mytagintheshortkeepsflippingupinitsdrivingnuts.
Thetermincubationislikelytoalwaysbecalledincubationin a scholarlyjournalortextorwhateverinresearch.
Somyintuitionorexpectationisthatwecanuseincubationas a startingpointbecausewecansearchthisdocumentforincubationandhopefullyveryclosetowherethewordincubationisused.
Wecansearchfor a duration, right, sonumbers.
Somyexpectationagainisthatwecouldprobablydo a reallybasicregularexpressionon, andthenlaterwecouldreallyrampupthatregularexpressiontofindmanymoreexamplesandthenhopefullyfilteroutanymistakes.
SowecanwegoprettyfastandloosewithourRNDhere, whereashistoricallyonethingthat I havelearnedisifyouaredefinitelyworkingwith a verylargedataset, yourkindofthisiswhat I woulddescribe.
It's kindof a preprocessing, data, preprocessingstep.
What I woulddois I wouldsearchforrelateddiseasesthatyouexpectcouldbecomparedtoandthensee, whatisthatbeingtalkedaroundaboutanywherearoundwherewe'reabouttopullanincubationtimeandifitis, forgetaboutit.
Butfornow, we'regonnakeepthingsverysimple.
We'regoinginrightovertexts.
Andthenwhatwe'regonnasayis, um, forsentencein t dotsplitbyperiodsspace.
Sowedon't splitbylikedecimals, Right?
Um, letusdrink.
Let's printprintsentence.
Butinfact, brew, let's sayifyeah, ifincubationin a sentence, printthesentence, UmandactuallyratherthanbreakingLet's prettyfewandjustkindofseewhatwe'redealingwith.
Sowhatusrunit.
Butyoudon't split.
I'm a littleconfusedhowwe'reseeing.
Howareweseeing?
Thatlookslike a lotmoretextthan I wouldexpecttobeseeing.
Persplitforsentence 13 texts.
T wasthefulltext, right?
Yeah.
Forsentencein t dotsplitby, huh?
DogsinBergen.
Hm?
Twonews.
Whyareweseeingthat?
Thatisodd.
I wouldnotexpectto.
I havetopausefrom a dog's goingcrazy.
Thisisbadtiming, becausewhat I don't understandisokay.
Let's gofigureoutwhatmydogsweregoingonabout.
I'llbeback.
Okay?
Tobehonest, I havenoidea.
I thinkwemaybegot a package.
I'm notreallysure, but, uh, don't know.
Anyway, continuingalongYeah, we'restillgetthis, like, fulltext, uh, convertingthemtovaluesdot I believethat's what I wantvalues.
Um, I'm just a littleconfusedifincubationinsentencesentence t dotSplitteentexts.
Someonecommentbelowremindmethebasicsofregularexpressionsbecause I wantthiswholething.
Possibly.
Butfornow, let's keepitsimple.
Like I said, Yeah.
Yougottoseehowterrible I canbebecausewewantthosetwothings, possiblyAnd I justdon't know.
There's gottabesomeway I thoughtitwaswithparentheses, but I think I'm wronganyway, aswesawbecauseyou'llgetlike, thatleadingnumber, butnotthefollowingnumber.
And I thinkthat's becausetheparentheses, like, picksthatparttofind.
I mightevenlikeitas I lookfurtherandtrytofixthisprogram, I wouldlookforeachoftheseandsee, umWhatWhatdidwegetthatgaveusthese?
Because I don't thinkthosearerealincubationtimes.
Sowewannafigureoutwherethosecamefromtodeterminehowcanwebetterimproveourscript, Butyeah, itlookslikesomewherebetweenfiveandfive, and I don't know, itwasthat 75 andsixdaysprior.
Fiveandseven.
WasWasthemouse a seven?
Um, looksliketheincubationtimeisprobably 5 to 7 daysonexpectedaverage.
Butthen, youknow, like I say, lookingbeforeifwetookthisonestackedthisontopoftheirrespectiveitclearlymorethan 10 maybeevenuptotwoweeks.
Butonthedatasetanyway, it's a cooldataset, andit's obviouslyanimportantdataseton.
I thinkit's a realisticsaid.
I mean, it's Israel's getsThisis a realproblemthatwe'reactuallyexperiencingrightnow, Uh, and I meanjustjustthedatasetingeneral, thisis a dataminingproblem.
Um, soanyway, I thinkthat's allifyou'vegotquestions, comments, suggestions, concerns, whatever, feelfreetoleavehimbelow.
Like I said, like, I wouldlookthroughsomeofthecolonel's, maybeparticipatesomeofthediscussionsandstuffandprobablylearnedsomereallyinterestingthings, soYep, that's allfornow.
Hopeyouguysarestayingsafeand I willseeyouguysinanothervideo.
what's goingon.
Subtitles and vocabulary
Click the word to look it upClick the word to find further inforamtion about it