Subtitles section Play video Print subtitles GOOD MORNING, EVERYONE. GREETINGS OF THOSE JOINING US THREW THE LIVE FEED. THIS IS WEEK SEVEN OF THE COURSE IT'S MY PLEASURE TO INTRODUCE TO YOU DR. LYNN JORDE FROM THE UTAH SCHOOL OF MEDICINE. HER RESEARCH INTERESTS ARE IN THE AREA OF GENE MAP AND EVOLUTIONARY GENETIC. HIS GROUP IS ANALYZING VARIATIONS IN GENE IN THE PATHWAY WITH THE GOAL OF HOPING TO BETTER UNDERSTAND THE ROLE OF THESE GENES IN TO HYPERTENSION AND HIS WORK ALSO FOCUSES ON A NUMBER OF OTHER DISORDERS SUCH AS THE GENETICS OF JUVENILE IDIOPATHIC ARTHRITIS AND INFLAMMATORY BOWEL DISEASE. NUCLEAR DNA FOCUSING ON MOBILE ELEMENTS, THE GOAL OF THIS WORK IS TO UNDERSTAND BETTER THE GEOGRAPHICAL ORIGIN AND MIGRATION OF MAN AND HOW THESE DATA MIGHT BE USED TO DETERMINE THE RELEVANCE OF RACE, RACE IN QUOTATION MARKS IN BIOMEDICAL SETTINGS WHICH THE DOCTOR WILL TALK TO YOU ABOUT THIS MORNING. FINALLY, LYNN IS ONE OF THE MOST WONDERFUL LECTURERS THAT I KNOW I WAS VERY SURE THAT YOU'RE GOING TO ENJOY TODAY AS LECTURE AND LEARN A LOT FROM HIM THIS MORNING. TODAY'S LECTURE AGAIN IS INTENDED TO PROVIDE YOU AN OVERVIEW OF THE FIELD OF POPULATION GENETICS MY PLEASURE TO INTRODUCE TO YOU. [APPLAUSE] >> THANK YOU VERY MUCH. IT'S A PLEASURE TO BE BACK HERE AGAIN. THIS MORNING I WOULD LIKE TO INTRODUCE YOU TO THE TWO OF THE FIELD OF POPULATION GENETICS. MY TALK WILL BE DIVIDED IN THREE PARTS, AT THAT TIME EARNS -- PATTERNS OF HUMAN GENETICS BOTH AMONG HUMAN AND AMONG INDIVIDUALS WHICH WE CAN NOW LOOK AT WITH SOME PRECISION. WE'LL TALK ABOUT THE IMPLICATIONS OF THIS WORK FOR CONCEPTS OF RACE, SOMETHING THAT IS CONTROVERSIAL AND SOMETHING THAT I THINK IS ILLUMINATED BY OUR GENETIC STUDIES. THEN FINALLY IN THE THIRD PART OF THE TALK WE'LL DISCUSS HOW POPULATION GENETICS, EVOLUTIONARY GENETICS INFORMS OUR UNDERSTANDING OF THINGS LIKE LINKAGE TO DISEQUILIBRIUM, THE HAPMAP AND OUR CONTINUING SEARCH FOR GENES UPPED LYING DISEASE. WITH MUTATION. THE GENERATOR OF GENETIC VARIATION. WE ESTIMATE BASED ON PHYLOGENETIC ANALYSIS THAT THE HUMAN MUTATION RATE IS ABOUT 2.5 TIMES TO THE -- PER BASE PER PER GENERATION. WHAT THAT MEANS IS THAT WE TRANSMIT ABOUT 75 OR SO NEW DNA VARIANTS WITH EACH GAMETE. I SHOULD SAY THAT SOME OF THE NEW GENOME ANALYSES OF FAMILIES ARE SUGGESTING THAT THIS RATE IS OVERESTIMATED AND THAT THE ACTUAL MUTATION RATE MAY BE ABOUT HALF OF THIS. INTERESTING TO SEE AS THOSE STUDIES COME OUT WHERE THE MUTATION RATE FINALLY LANDS. BUT WE THINK NOW IT MIGHT BE ROUGHLY HALF OF THE USUALLY CITED PHYLOGENETIC ESTIMATE. HERE IS A QUOTE FROM LOUIS THAT I LIKE VERY MUCH. HE SAID THE CAPACITY TO BLUNDER SLIGHTLY IS THE REAL MARVEL OF DNA, WITHOUT THIS SPECIAL ATTRIBUTE WE WOULD STILL BE ANAEROBIC BACTERIA AND BE NO NUCLEUS US. THAT IS A LOVELY QUOTE. REMAINS US OF WHY WE SHOULD BE THANKFUL FOR OUR MUTATIONS. WELL, ONE OF THE QUESTIONS THAT WE CAN ASK AS WE LOOK AT DNA VARIATION IN INDIVIDUALS AND POPULATIONS SPECIE IS HOW MUCH AT THE DNA LEVEL DID WE DIFFER? IF WE LOOK AT ALIGNED DNA BASE DIFFERENCES, IDENTICAL TWINS BEING NATURE'S CLONES HAVE ZERO DNA SEQUENCE DIFFERENCES. A FAMOUS FIGURE NOW, IS THAT IF WE LOOK AT ANY TWO UNRELATED HUMANS, FOR ALIGNED DNA BASE DIFFERENCES WE VARY ABOUT ONLY ONE IN 1,000. SEVERAL TIMES MORE THAN THIS IF YOU INCLUDE COPY NUMBER VARIANTS BUT FOR ALIGNABLE SEQUENCE WE ARE AS HAS BEEN SAID MANY TIMES NOW 99.9%. IF WE COMPARE OURSELVES IN THE SAME WAY TOURER NEAREST NEIGHBOR, THE CHIMP WE DIFFER ABOUT 1 IN A HUNDRED BASE PAIRS FROM THE CHIMP. IN THE SENSE WE'RE 99% CHIMP. IF WE COMPARE OURSELVES TO MOUSE, ABOUT ONE IN THREE BASE DIFFERS COMPARES HUMANS AND MOUSE. THIS WE GO OUT FURTHER WE ARE DIFFERENT FROM BROCCOLI. BUT WHAT THIS MEANS IS THAT WE HAVE THREE BILLION DNA BASE PAIRS, THAT MEANS THERE ARE ABOUT THREE MILLION DIFFERENCES BETWEEN EACH PAIR OF HUMANS. SO A TREMENDOUS RESERVOIR OF GENETIC VARIATION THAT ACCOUNTS FOR THE DIVERSITY THAT WE SEE. HOW MUCH DO POPULATIONS DIFFER? THAT IS THE FIRST AREA THAT WE LOOK AT THIS MORNING. HERE WE SEE A MAP OF THE WORLD WITH THE POPULATION THAT I'LL BE TALKING ABOUT DESIGNATED. WE'VE BEEN TRYING TO SAMPLE MORE AND MORE EXTENSIVELY ACROSS THE WORLD AND IN COLLABORATION WITH THE FOUNDATION IN SALT LAKE CITY WHO HAVE COLLECTED MORE THAN 100,000 DNA SAMPLES FROM ALL OVER THE WORLD. I'M GO STEMMING YOU ABOUT VARIATION IN A THOUSAND INDIVIDUALS REPRESENTING 40 POPULATIONS. REALLY QUITE DIVERSITY. HERE ARE SOME OF THE PHOTOGRAPHS OF SOME OF THE TALKS I'LL BE TELLING YOU ABOUT. TO ASSESS VARIATION THE STANDARD APPROACH IS TO LOOK AT ALLELE FREQUENCIES. IF WE IMAGE IN THAT WE HAVE THREE SNPS SINGLE NUCLEOTIDE POLYMORPHISMS THAT WE'RE ANALYZING. WE CAN SEE THAT THERE ARE DIFFERENCES IN THE FREQUENCIES OF THESE THREE SNPS IN POPULATIONS ONE, TWO AND THREE. WE LOOK AT THOSE DIFFERENCES, WE LOOK AT THAT VARIATION TO ASSESS PATTERNS OF SIMILARITY AMONG POPULATIONS. THIS IS ONE OF TWO EQUATIONS THAT I'LL SHOW YOU. I'M NOT GOING TO TORTURE YOU WITH MATHEMATICS THIS MORNING THIS IS PRETTY SIMPLE. IT SHOWS HOW WE ESTIMATE A STATISTIC CALLED FST TO ASSESS VARIATION BETWEEN POPULATION. SO, FST IS THE AMOUNT OF GENETIC VARIATION THAT IS DUE TO POPULATION DIFFERENCES. AND WE GET IT BY LOOKING AT THE TOTAL HETEROZYGOCITY. TOTAL VARIATION ACROSS OUR SAMPLES, THAT'S H SUB2. FROM THAT TOTAL WE SUBTRACT THE AVERAGE HETEROZYGOCITY WITHIN EACH SUBDIVISION. IF FST IS ZERO ALL OF THE VARIATION THAT WE AB EVERYBODY DAB DAB OBSERVE ARE IN POPULATIONS, THE AVERAGE WITHIN SUBDIVISIONS IS EQUAL TO THE TOTAL. THERE'S NO DIFFERENCE BETWEEN POPULATION. THE OTHER HAND IF FST IS ONE, THEN ALL OF THE VARIATION EXISTS BETWEEN POPULATIONS. ONLY WAY WE CAN GET THIS WOULD BE FOR H TO BE ZERO. IN OTHER WORDS ALL OF OUR POPULATION OR IN THIS CASE CONTINENTS WOULD CONSIST OF IDENTICAL CLONE. NO VARIATIONS WITHIN SUBDIVISIONS. FST OF ONE, MAXIMUM VARIATION BETWEEN SUBDIVISIONS, FST OF ZERO, NONE. IF WE LOOK AT THIS STATISTIC IN SERIES OF POLYMORPHISMS IN THE SAMPLES, IF WE LOOK AT STRS, RESTRICTION SITE POLYMORPHISMS AND L1S AND 250K SNP ARRAY WE SEE THAT THE FST VALUE FOR OUR HUMAN POPULATION IS PRETTY CONSISTENTLY SOMEWHERE BETWEEN 10-15%. THIS WE SEE IN MANY STUDIES DIFFERENT KINDS OF DNA POLYMORPHISMS, DIFFERENT SETS OF POPULATIONS, BUT TYPICALLY ROUGHLY 10 TO 15% OF GENETIC VARIATION CAN BE ASCRIBED TO DIFFERENCES BETWEEN THESE MAJOR POPULATIONS. THAT'S RELATIVELY SMALL AMOUNT. TELLING US THAT THERE ISN'T THAT MUCH VARIATION BETWEEN HUMAN POPULATIONS OR THESE LARGELY NEUTRAL DNA VARIANT. WE CAN COMPARE THAT WITH SKIN PIGMENTATION. , WE SEE THE OPPOSITE PATTERN. 90% OF VARIATION EXISTS BETWEEN POPULATION. VERY DIFFERENT PATTERN FROM WHAT WE LOOK AT ACTUAL DNA VARIATION. THESE ARE TRAITS, THIS IS STRONGLY SELECTED IN HUMAN POPULATION FOR DIFFERENT AND DIVERGENT. IF WE LOOK AT THE THREE HAPMAP POPULATIONS OF THE EUROPEAN DERIVED POPULATIONS, THE TWO ASIAN POPULATIONS WE JUST LOOK AT THOSE, THAT'S A LIMITED SAMPLE OF HUMAN DIVERSITY. GIVES US A HIGH FST VALUE ABOUT 15%. IF WE START TO SAMPLE MORE POPULATIONS OF FST TENDS TO GO DOWN THAT IS LESS VARIATION BETWEEN POPULATIONS AS WE SAMPLE MORE OF THE WORLD'S DIVERSITY. HERE LOOKING AT 27 POPULATIONS PST HAS GONE DOWN. THE IMPORTANCE IMPORTANT AS WE SAMPLE MORE EVENLY ACROSS THE GLOBE THIS LEVEL OF DIFFERENTIATION TENDS TO GO DOWN. IT'S NOT GOING TO ZERO BECAUSE OF COURSE THERE IS VARIATION AMONG HUMAN POPULATION. BUT IMPORTANTLY IN THE OVERESTIMATED IF WE SAMPLE SELECTIVELY. ASK, WHAT PROPORTION OF SNP ARE SHARED AMONG POPULATIONS. HERE WE'RE LOOKING AT COMMON SNPS FROM THE ENCODE DATABASE WHERE THE MINOR ALLELE FREQUENCY IS GREATER THAN 5%. ABOUT 80% OF THE MINOR ALLELE ARE SHARED AMONG THE THREE MAJOR CONTINENTAL POPULATIONS. FEWER THAN 1% ARE SEEN JUST IN ASIAN POPULATION AND FEWER THAN 1% ARE RESTRICTED JUST TO EUROPEAN POPULATION. ABOUT 6% ARE SPECIFIC TO AFRICAN POPULATION. MORE DIVERSITY INNERVE CAN AND DIVERSITY OUTSIDE OF AFRICA BEING LARGELY SUBSET OF WHAT WE SEE IN AFRICA. THIS PAPER JUST CAME OUT TWO, COMPLETE AFRICAN GENOMES. YOU CAN SEE AGAIN COMPARING THOSE GENOMES, NOW LOOKING AT WHOLE GENOME SEQUENCES SO THIS INCLUDES NOT JUST COMMON VARIANTS BUT ALSO RARE VARIANTS, WE CAN SEE THAT STILL AS WE'RE COMPARING THE INDIVIDUALS THERE'S STILL A LOT OF SHARING OF VARIANTS AMONG THESE INDIVIDUALS EVEN LOOKING AT RARE VARIANTS. NOT AS MUCH SHARING WHEN WE'RE LOOKING AT COMMON, BECAUSE THE COMMON VARIANTS TEND TO BE OLDER MORE LIKELY TO BE SHARED. BUT STILL INTERESTING LEVEL OF DNA SHARING AMONG THESE INDIVIDUALS FOR WHOM WE NOW HAVE COMPLETE GENOME SEQUENCE. HOW DO WE ASSESS GENETIC DISTANCE DIFFERENCES BETWEEN POPULATION. WE CAN DO SIMPLE GENETIC DISSTANCES. WE DEFINE THE DISTANCE BETWEEN TWO POPULATIONS, CALL THEM I AND J BY THE DIFFERENCE IN ALLELE FREQUENCIES. PIECE OF I AND PIECE OF J ARE THE ALLELE FREQUENCIES IN THE TWO POPULATIONS. WE GO BACK TO THE FREQUENCIES I SHOWED YOU. WE HAVE THREE. WE'RE LOOKING AT THREE SNPS. IF WE WANT TO ASSESS THE DISTANCE BETWEEN POPULATIONS ONE AND TWO WE CAN SIMPLY FOR THIS SNP, SNP ONE, SUBTRACT THE DIFFERENCE IN THEIR FREQUENCIES IN POPULATIONS 1 AND 2 THAT'S VERY SIMPLE GENETIC DISTANCE ESTIMATE. THEN TO GET THE OVERALL DISTANCE BETWEEN POPULATIONS 1 AND 2 WE WOULD JUST AVERAGE THIS DISTANCE WITH THE OTHER -- DISTANCES DERIVED FROM THE OTHER TWO SNPS. IT'S REALLY PRETTY SIMPLE. ON AVERAGE HOW DIFFERENT ARE THESE POPULATIONS IN TERMS OF THEIR ALLELE FREQUENCY F. THAT WE CAN BUILD A NETWORK OF SIMILARITY AMONG POPULATIONS. WE HAVE OUR THREE POPULATIONS, WE HAVE OUR DISTANCE. WE CAN DRAW A NODE BETWEEN THOSE TWO POPULATIONS, THEN WE CAN TAKE THE AVERAGE OF SAYS SUBTRACT THAT FROM THE FREQUENCY IN THE THIRD POPULATION THAT GIVES US ANOTHER NODE IN OUR NETWORK. THAT THE HOW WE SHOW THE THEM IN TERMS OF ALLELE FREQUENCIES. JUST BY LOOKING AT THEM ARE A LITTLE MORE SIMILAR TO EACH OTHER THAN THEY ARE TO POPULATION THREE. THAT'S WHAT THE NETWORK DISPLAYS FOR US F. WE DO THIS NOW FOR SERIES OF POPLY MORPHISMS NOW LOOKING AT ALU INSERTION POLYMORPHISMS. THOSE ARE SHORT INTERSPERSED NUCLEAR ELEMENTS THAT INSERTED RECENTLY IN TO THE HUMAN GENOME SOME PEOPLE WILL HAVE AN ALU AT SPECIFIC CHROMOSOME LOCATION, OTHERS DON'T. THEN WE'RE ASSESS THE FREQUENCIES OF THOSE, IN THIS CASE 100ALU POLYMORPHISMS, WORK WE DID A FEW YEARS AGO AND WE'RE LOOKING AT VARIOUS HUMAN POPULATION. YOU SEE SOME INTERESTING PATTERNS HERE, FIRST OF ALL YOU SEE THAT POPULATION DO TEND TO GROUP TOGETHER ACCORDING TO THEIR CONTINENT OF ORIGIN. THIS ISN'T REALLY A SURPRISE, THEY ARE MORE LIKELY TO HAVE COMMON HISTORY THERE IS A CORRELATION BETWEEN ANCESTRY AND GEOGRAPHIC LOCATION. SO, WE CAN SEE POPULATIONS FROM MAJOR CONTINENTS, ESSENTIALLY GROUPING TOGETHER. THE OTHER THING WE NOTICE HERE IS THAT THERE'S A LOT MORE DIVERSITY AMONG THE AFRICAN POPULATION THAN IN THE REST OF THE WORLD. WE'LL COME BACK TO THAT. IF I WOULD ASSESS THE STATISTICAL SIGNIFICANCE OF THESE RESULTS, THEE ARE BOOTSTRAP SUPPORT LEVELS THEY'RE VERY HIGH TELLING US THAT THESE GROUPINGS HAVE SUBSTANTIAL STATISTICAL SUPPORT. A SIMILAR NETWORK ONLINE ONE INSERTION POLYMORPHISMS WE SEE THE SAME PATTERN. NETWORK BASED ON A 250K ARRAY, AGAIN WE SEE THAT SAME PATTERN. HERE ARE SERIES OF AFRICAN POPULATIONS, EUROPEAN POPULATIONS, SERIES FROM SOUTH ASIA, EAST ASIA. AND NEW WORLD OVER HERE IN YELLOW. SO THESE POPULATIONS DO TEND TO GROUP TOGETHER ACCORDING TO THEIR GEOGRAPHIC LOCATION. NOW ADDED THE HDGP SAMPLES. ANOTHER 40 POPULATIONS WE SEE AGAIN THE SAME PATTERN. IT'S REALLY QUITE ROBUST. THIS IS ANOTHER ANALYSIS PUBLISHED IN "NATURE" A COUPLE YEARS AGO BY ANOTHER GROUP, SOMEWHAT DIFFERENT SAMPLE OF POPULATIONS LOOKING BOTH AT CNBS AND SNPS AND WE SEE THE SAME PATTERNS, WE ASSURING LEVEL OF CONSISTENCY ACROSS STUDIES. IF WE PLOT HAPLOTIGHT HETEROZYGOCITY. WE'RE SAYING, HOW MUCH VARIATION IN HAPPEN LAY TIGHT IS THERE ACROSS THE -- HAPLOTIGHT ACROSS THE WORLD AS IT RELATES IN DISANSWER FROM EAST AFRICA. WHAT WE AUTO IS A HAPLOTIGHT DIVERSITY STEADY DECLINES AS WE GO FURTHER AWAY FROM. FIRST AROSE IN AFRICA THAT IS AN TOPICALLY MODERN HUMAN, PEOPLE THAT LOOK LIKE US AROSE ROUGHLY 200,000 YEARS AGO STAYED IN AFRICA AT LEAST 100O YEARS, DEVELOPED -- 100,000 YEARS, DEVELOPED VARIATION THEN SMALL SUBSET WENT OUT TO COLONIZE THE REST OF THE WORLD. AS A RESULT OF THAT VARIATION IN THE REST OF THE WORLD TENDS TO BE LESS THAN IN AFRICA TENDS TO BE SUBSET OF WHAT WE SEE IN AFRICA. ALL VERY CONSISTENT WITH THE RECENT AFRICAN ORIGIN OF OUR SPECIES AND A COMMON ORIGIN OF OUR SPECIES. THIS IS A DIFFERENT TAKE ON HUMAN ORIGINS. I WAS IN THE SUPERMARKET A FEW YEARS AGO AND MY EYES WERE CAUGHT BY THIS HEADLINE "ADAM AND EVE SKELETON HAD BEEN STOLEN" I WASN'T AWARE THEY HAD BEEN STOLEN, BECAUSE I WAS PROMISED MORE AMAZING PHOTOS INSIDE, I HAD TO BUY IT. WHAT I DISCOVERED, ALL THAT'S LEFT IS EVE'S LEGS AND THE IDENTITY OF THE PERPETRATOR MAY HAVE BEEN ESTABLISHED. INEVITABLY IF WE'RE TALKING ABOUT DIFFERENCES AMONG POPULATIONS, AMONG INDIVIDUALS, THE ISSUE OF RACE COMES UP. WHAT DOES GENETICS NOW TELL US ABOUT TRADITIONAL CONCEPTS OF HUMAN RACE. I THINK WHAT YOU'LL SEE IS THAT OUR VIEW OF RACE MUCH MORE COMPLEX AS WE BEGIN TO LOOK AT GENETIC DATA. FIRST WE CAN ASK THE QUESTION, WHY DOES RACE EVEN MATTER? WHY DOES IT KEEP COMING UP IN OUR DISCUSSION? WELL, CERTAINLY THE PREVALENCE OF MANY DISEASES IS KNOWN TO VARY BY POPULATION AND ALONG LINES THAT CORRESPOND TO TRADITIONAL RACIAL DESIGNATIONS, THINGS LIKE PROSTATE CANCER, HYPERTENSION. WE KNOW THAT SOME RELATIVELY COMMON DISEASE PREDISPOSING VARIANTS VARY AMONG POPULATIONS. THINGS LIKE CLOTTING FACTOR V L,IDEN VARIANT. SUBSTANTIALLY MORE COMMON AMONG EUROPEANS THAN AMONG OTHER INDIVIDUALS. THERE'S EVIDENCE RESPONSE TO SOME DRUGS MAY VARY AMONG POPULATION. AFRICAN-AMERICANS MAY ON AVERAGE BE LESS RESPONSE SIEVE TO ACE INHIBITORS, BETA BLOCKERS AND I EMPHASIZE "MAYBE" BECAUSE WE COME BACK TO THAT POINT AND WHAT THAT MEANS. IN THE FORENSIC DATABASES THAT ARE COMMONLY USED BY THE FIB AND BY CRIME LABS THEY ARE GROUPED ALONG TRADITIONAL, QUOTE, RACIAL, LINES. THESE DESIGNATIONS ARE USED COMMONLY IN MANY WAYS. THE QUESTION IS, WHAT CAN GENETICS TELL US ABOUT THEIR VALIDITY. IT'S INTERESTING TO LOOK AT SOME COMMENTS OVER THE LAST DECADE ON RACE. THERE WAS EDITORIAL IN THE "NEW ENGLAND JOURNAL" THAT ASSERTED THAT RACE IS BIOLOGICALLY MEANINGLESS. IN A RESPONSE TO THE "NEW YORK TIMES" A PSYCHIATRIST DR. SATEL RESPONDED "I AM A RACIALLY PROFILING DOCTOR." DELIBERATELY PROVOCATIVE COMMENT, POINT WAS THAT SHE USES POPULATION AFFILIATION TO HELP DECIDE DOSAGE AND DRUGS FOR HER PATIENTS. THE STATEMENT A FEW YEARS FROM THE AMERICAN ANTHROPOLOGIC ASSOCIATION THAT "ANY TWO INDIVIDUALS WITHIN A PARTICULAR POPULATION ARE AS DIFFERENT GENETICALLY AS ANY TWO PEOPLE SELECTED FROM ANY TWO POPULATIONS IN THE WORLD." WE'LL SEE WHAT THE GENETIC DATA ACTUALLY DO TELL US. WHEN THERE ARE SO MANY DIVERGENT OPINIONS ON AN ISSUE IT'S TIME TO LOOK AT DATA. THE COVER OF "SCIENTIFIC AMERICAN" A FEW YEARS AGO ASKED, DOES RACE EXIST? AND THIS IS THE PART THAT CAUGHT MY ATTENTION. SCIENCE HAS THE ANSWER. ANY TIME AS A SCIENTIST I THINK WHEN WE SEE THAT "SCIENCE HAS THE ANSWER" WE GET SKEPTICAL. LET'S LOOK AT THE DATA. WE TABULATE DNA SEQUENCES AMONG INDIVIDUALS. NOW INSTEAD OF LOOKING AT POPULATION WE LOOK AT INDIVIDUALS. WE CAN PICK A FEW INDIVIDUALS WHOSE DNA SEQUENCE WE HAVE HYPOTHETICALLY OBTAINED. WE'RE LUBING AT GEORGE BUSH, JOHN MCCAIN, HILLARY CLINTON I COULDN'T RESIST PUTTING IN JOHN EDWARDS. I'M NOT SURE IF ANYONE WANTS TO SHARE DNA WITH JOHN EDWARDS THESE DAYS BUT WE CAN COMPARE THEIR SEQUENCES. SO AS WE'RE MAKING THESE DNA NETWORKS WHAT WE'RE INTERESTED IN IS SEQUENCE DIFFERENCE AMONG OUR PAIRS OF INDIVIDUALS. IF WE COMPARE BUSH AND MCCAIN, WE SEE THAT THERE ARE TWO SEQUENCE DIFFERENCES. WE PUT A TWO IN OUR MATRIX BETWEEN BUSH AND MCCAIN F. WE COMPARE BUSH AND CLINTON WE SEE THAT THERE ARE FIVE DIFFERENCES. WE PUT A FIVE HERE. BUSH AND EDWARDS, SIX. EDWARDS AND MCCAIN, FOUR. SO FORTH. WE MAKE A MATRIX OF DNA DIFFERENCES AMONG OUR PAIRS OF INDIVIDUALS. FROM THAT WE CAN MAKE A DIAGRAM OR TREE THAT SHOWS HOW SIMILAR THEY ARE. THIS IS HYPOTHETICAL. IT GIVES YOU DISPLAY THEN OF DISTANCES -- DIFFERENCES AMONG OUR PAIRS OF INDIVIDUALS. IF WE'RE LOOKING AT JUST A FEW PEOPLE WE CAN EASILY LOOK AT THE MATRIX ITSELF AND SEE THE PATTERN. BUT IMAGINE LOOKING AT HUNDREDS OR EVEN THOUSANDS OF INDIVIDUALS THEN IT BECOMES MUCH MORE DIFFERENT BY LOOKING AT THE MATRIX OF A THOUSAND BY A THOUSAND. THESE DISPLAYS HELP US TO SEE THE PATTERN VERY EASILY. STEVE WHO WORKS WITH US FEW YEARS AGO SAW THIS MATRIX IN THE "NEW YORK TIMES," MATRIX OF PERCENT DISAGREEMENT AMONG THE NINE SUPREME COURT JUSTICES. HE WAS LEARNING POPULATION GENETICS AT THIS TIME HE THOUGHT THIS WAS GOOD EX ARE SIGHS. IF YOU LOOK AT THIS MATRIX YOU CAN SEE SOME PATTERNS, YOU CAN SEE FOR EXAMPLE THAT JUSTICES THOMAS AND SCALIA HAVE ONLY 9% DISAGREEMENT. PRETTY SIMILAR. BUT STILL IT'S NOT SO EASY TO DEDUCE THE WHOLE PATTERN UNTIL YOU MAKE A DIAGRAM. A TREE THEN YOU CAN SEE THE PATTERN VERY EASILY. WE HAVE THE CONSERVATIVE WING OF THE COURTS HERE. THE OTHER WING OF THE COURT OVER HERE SHOWING UP VERY NICELY ON THIS DISPLAY. IF WE DO THE SAME THING AT AT DNA, WE'RE LOOKING A THE ANGIOTENSINOGEN GENE. WE LOOKED AT 14BK, HOW SIMILAR BE THE MEMBERS OF THESE CONTINENTAL POPULATIONS, ASIAN, EUROPEANS AND AFRICANS. WHAT WE'VE SEEN IS THAT SOMETIMES AN INDIVIDUAL FROM AFRICA IS ACTUALLY MORE SIMILAR TO PEOPLE FROM ASIA OR PEOPLE FROM EUROPE THAN TO OTHERS FROM AFRICA. WHEN WE'RE LOOKING AT THIS SINGLE GENE. LOOKING AT 14KB OF SEQUENCE. WHAT THAT IS REFLECTING IS THE MIXED ANCESTRY OF INDIVIDUALS WITH REGARD TO SPECIFIC GENES. OUR COMPLEX HISTORY OF MIGRATION AND MIXED, WE SEE WITHIN WE LOOK AT HUMANS, WE SEE GENES FROM EUROPE AND AFRICA. WE SEE GENES FROM ASIA AND EUROPE. WE HUMANS DO HAVE A MIXED AND VERY COMPLEX HISTORY OF MIGRATION THERE. IS NO SUCH THING AS A QUOTE, PURE, HUMAN POPULATION. GENETIC DATA TELLS US THAT VERY CLEARLY. CARL DARWIN SAID THAT IT MAY BE DOUBTED WHETHER ANY CHARACTER CAN BE NAMED WHICH IS DISTINCTIVE OF A RACE AND IS CONSTANT. SO, DARWIN WAS WELL AWARE OF IT. THAT CHARACTERS THAT HE WAS LOOKING AT TENDED TO VARY IN FREQUENCY AMONG POPULATION BUT SELDOM COULD YOU DEFINE POPULATION BASED ON ANY GIVEN CHARACTERISTIC. NOW, HERE IS WHAT WE DID A FEW YEARS AGO WAS TO LOOK AT LARGER NUMBER OF VARIANTS, IN THIS CASE, ALU, STR AND RESTRICTION SITE POLYMORPHISMS, CLOSE TO 200 POLYMORPHISMS ASKED THE SAME QUESTION. HOW SIMILAR ARE THESE INDIVIDUALS IF NOW INSTEAD OF LOOKING AT SEQUENCE FROM ONE GENE WE'RE LOOKING AT 190 INDEPENDENT VARIANTS. NOW WE SEE THAT THERE IS SOME PATTERN HERE WHERE OUR SAMPLES FROM ASIA TEND TO GROUP TOGETHER FROM, EUROPE GROUP TOGETHER, AFRICA, GROUP TOGETHER, WE'RE USING A LOT MORE INFORMATION WE'RE PICK UP MORE ABOUT THE ANCESTRIES OF -- ANCESTRYS OF THESE INDIVIDUALS NOTICE THAT THESE BRANCHES MOST OF THE BRANCH LENGTH IS SEEN WITHIN POPULATION. THIS IS CONSISTENT WITH THE FST STATISTICS THAT SAID MOST OF THE VARIATION WE SEE, WE SEE WITHIN MAJOR POPULATIONS. BUT THERE IS ENOUGH VARIATION JUST ENOUGH BETWEEN POPULATIONS SO THAT IF WE'RE LOOKING AT A LOT OF CHARACTERS WE CAN BEGIN TO SEE A REFLECTION OF PARTIAL ISOLATION OF THESE POPULATIONS THROUGH THEIR HISTORY. THE ANALOGY I LIKE TO USE IS LET'S SAY HEIGHT F. WE ONLY LOOK AT HEIGHT, WE'RE GOING TO SEE QUITE A LOT OF OVERLAP BETWEEN OUR MALE AND FEMALE POPULATION. IF WE ADD ANOTHER CHARACTERISTICS, LET'S SAY WAIST/HIP RATIO NOW WE HAVE MORE INFORMATION THAT ALLOWS US TO DISCERN MALES VERSUS FEMALES. AND SO THERE'S LESS OVERLAP BETWEEN THEM. THAT'S WHAT WE'RE DOING AS WE LOOK AT MORE GENETIC CHARACTERS WE'RE LEARNING MORE ABOUT ANCESTRYS OF THESE INDIVIDUALS, STARTING TO SEE MORE OF THE NONOVERLAPPING PART OF THOSE CIRCLES I SHOWED YOU. IF WE DO THIS WITH A LARGER NUMBER OF CHARACTERS, WE ARE USING A SNP ARRAY WE START TO SEE A PATTERN IN THIS DIAGRAM OF INDIVIDUALS. DON'T WORRY ABOUT READING THE LABELS, THEY CORRESPOND TO POPULATIONS SO THAT IF WE USE LARGE NUMBER OF SNPS ACROSS THE HUMAN GENOME WE START TO SEE INDIVIDUALS SORTING ACCORDING TO THEIR POPULATION OF ORIGIN. NOW, SHY POINT OUT THESE POPULATIONS ARE PRETTY WELL SEPARATED FROM EACH OTHER GEOGRAPHICALLY. BUT WE DO SEE, FOR EXAMPLE, AFRICAN POPULATIONS HERE, EUROPEAN POPULATION, SOUTH INDIAN POPULATION. SOUTH PACIFIC, NEW WORLD, ASIAN, SO FORTH. IS THERE ORDER OF ANCESTRY IF WE LOOK AT LOTS OF INDEPENDENT DNA CHARACTERS. RECENTLY MY COLLEAGUE IN MY DEPARTMENT LOOKED AT THE WHOLE GENOME SEQUENCE DATA, NOW PUBLICLY AVAILABLE FOR TEN INDIVIDUALS. IF WE LOOK AT WHOLE SEQUENCE DATA WE SEE THAT AGAIN INDIVIDUALS TEND TO SORT OUT ACCORDING TO CONTINENT OF ORIGIN. WHICH IS NOT TOO SURPRISING IF WE DOCK THIS WITH 10K -- 10,000 SNPS WE EXPECT IF WE LOOK AT WHOLE SEQUENCE DATA WE'RE GOING TO SEE A SIMILAR PATTERN. NOW, ANOTHER THING THAT MARK FOUND THAT I THINK IS PRETTY INTERESTING IS THAT THERE IS SOME VARIATION DEPENDING ON THE SEQUENCING PLATFORM THAT IS USED. THESE ARE THE SAME AFRICAN SAMPLE BUT LOOK DIFFERENT DEPENDING WHERE -- THERE WERE 557,000 DIFFERENCES BETWEEN THEM GENERATED BY DIFFERENCES IN PLATFORMS. ALTHOUGH THERE IS CONSISTENCY HERE WITH ANCESTORS, THERE IS ALSO PLATFORM VARIATION THAT WE SEE IN OUR DIAGRAM. YOU MIGHT SAY, WELL, WHAT IS WHOLE GENOME SEQUENCE TELLING US THAT WE CAN'T GET WITH JUST A SAMPLE OF SNP. THIS IS VERY NEW STUDY, JUST CAME OUT WHERE WE WERE ABLE, ONE OF MY POST DOCS WAS ABLE TO USE THE PRESENCE OF ALU INSERTIONS IN WHOLE SEQUENCE DATA TO MARK REGIONS OF THE GENOME THAT ARE ANCIENT. HE COMPARED THE PUBLISHED NIH SEQUENCE WITH THE HUREF. THE IDEA HERE IS THAT IF WE LOOK AT WHOLE SEQUENCE VARIATION, THE AVERAGE TIME FOR ANY GENOMIC REGION ACROSS THESE TWO GENOMES THEY WOULD HAVE COMMON ANCESTOR ABOUT 460,000 YEARS AGO. BUT, BECAUSE ALU INSERTIONS ARE RARE EVENTS, ONLY OCCUR IN ONE IN 20 BIRTHS, THEY ARE TEND TO BE PRESERVED JUST IN VERY OLD REGIONS OF THE GENOME. BECAUSE THEY ARE SO RARE, IN OTHER WORDS, BETWEEN MY BROTHER AND ME THERE'S VERY LITTLE CHANCE THAT WE HAVE -- THAT I WOULD HAVE ALU INSERTION ON SPOT ON CHROMOSOME FIVE THAT HE DOESN'T HAVE. BECAUSE THESE ARE RARE EVENTS THEY MARK ANCIENT PARTS OF THE GENOME WHERE ACTUALLY THE AVERAGE COALESCENESS TIME IS ABOUT 900,000 YEARS. FOR THESE REGIONS OF THE GENOME, REGIONS IN WHICH AN ALU HAS INSERTED THEY TEND TO BE VERY OLD WE CAN LOOK AT SEQUENCE VARIATION IN THOSE REGIONS, TO PROBE VERY ANCIENT HISTORY IN OUR SPECIES AND IN OUR ANCESTORS. WHAT CHAD WAS ABLE TO DO, THIS PAPER WAS JUST PUBLISHED IN JANUARY, ACTUALLY THEY ESTIMATE THE EFFECTIVE POPULATION SIZE OF HUMAN ANCESTORS, 1.2 MILLION YEARS AGO. WHAT'S INTERESTING IS THAT THAT ESTIMATE IS ONLY ABOUT 18,000. WE AT THAT TIME OUR ANCESTORS PREHUMAN ANCESTORS HAD VERY SMALL POPULATION. EFFECTIVE SIZE OF ANATOMICALLY MODERN HUMANS IS QUITE SMALL BUT THAT SUGGESTS THAT OUR ANCESTORS WERE FAIRLY CLOSE TO EXTINCTION AT ONE POINT IN THEIR HISTORY. WHAT I FIND REMARKABLE IS THAT WE CAN LEARN THIS, LEARN IT WITH QUITE A LOT OF PRECISION FROM JUST TWO HUMAN DNA SEQUENCES. BECAUSE WITH WHOLE GENOME SEQUENCE WE HAVE A LOT OF INFORMATION. WE CAN MAKE ESTIMATES LIKE THIS THAT WOULD BE IMPOSSIBLE WITHOUT THAT MUCH INFORMATION, WITHOUT THAT MUCH VARIATION IS LOOK AT. SO, LOT OF INTERESTING THINGS THAT WE DOCK WITH THESE WHOLE GENOME SEQUENCES AND WE'RE VERY HAPPY THAT THEY ARE PUBLICLY AVAILABLE SO THAT WE CAN ALL LOOK AT THEM AND THINK OF INTERESTING THINGS TO DO WITH THEM. HERE IS ANOTHER WAY OF LOOKING AT GENETIC DISTANCES AMONG POPULATIONS AT SIMILARITIES AND DIFFERENCES. THIS IS CALLED A PRINCIPLE COMPONENT ANALYSIS. USED VERY COMMONLY IN POPULATION GENETICS. I WON'T GO IN TO DETAILS ABUT THIS, BUT BASICALLY WHAT IT DOES IS DISPLAYS DIFFERENCES AMONG OUR INDIVIDUALS, IN THIS CASE IN THREE DIMENSION. THEN THIRD ONE THIS IS KIND OF THE THIRD DIMENSION -- THE POINT SHEER THAT IF WE LOOK AT ONLY TEN SNPS, WE LOOK AT SMALL COLLECTION OF VARIATION ACROSS THE GEE NO, MA'AM WE CAN'T REALLY SEE -- GENOME, WE CAN'T SEE MUCH OF A PATTERN, WOE DON'T HAVE VERY MANY CHARACTERS, THERE'S A LOT OF OVERLAP. IF WE LOOK AT 100 SNPS IN THE SAME INDIVIDUALS THIS SAME KIND OF DISPLAY WE SEE A PATTERN BUT STILL GREAT DEAL OF OVERLAP NOT REALLY A DISCERNIBLE PATTERN HERE IN THREE DIMENSIONS. IF WE LOOK AT A THOUSAND SNPS WE START TO SEE SOME DISCERNIBLE PATTERN. IN FACT THESE GROUPS CORRESPOND TO THE MAJOR CONTINENTAL POPULATION. 23 WE LOOK AT 250,000 SNPS THERE'S EVEN MORE OF A PATTERN. WE CAN BEGIN TO SEE INDIVIDUALS SORTING IN TO POPULATIONS, FROM INDIA HERE, ASIAN POPULATIONS, EUROPEAN, AFRICAN POPULATIONS. AGAIN WITH LOTS OF INFORMATION WE CAN START SEE INDIVIDUALS SORTING IN TO POPULATIONS OF ORIGIN. THE HAPMAP FITS WHERE WE WOULD EXPECT THEM TO. IF YOU TRANSLATE THESE DIMENSIONS THE FIRST ONE IS SORT OF AFRICA VERSUS NON-AFRICA, SO THIS IS SORT OF THE OUT OF AFRICA DIMENSION OF THIS PLOT. THE SECOND ONE IS PRETTY MUCH EAST-WEST ACROSS THE OLD WORLD AND THIRD ONE IS NORTH-SOUTH ORIENTATION. IT GIVES US THREE DIMENSIONS OF HUMAN GENETIC VARIATION. THIS IS SIMILAR PLOT LOOKING AT 850 INDIVIDUALS IN TWO DIMENSIONS WE HAVE ESSENTIALLY GOING FROM WEST TO EAST. AGAIN WE SEE INDIVIDUALS SORTING OUT BUT I ALSO WANT TO POINT OUT THAT THERE IS OVERLAP. WE CAN'T, ESPECIALLY NOW THAT WE'RE SAMPLING MORE BROADLY COULDN'T DRAW SHARP BOUNDARY AMONG THESE VARIOUS POPULATIONS. THAT'S VERY IMPORTANT POINT. HERE WE'RE LOOKING AT EURASIA ALONE. YOU START SEE A MAP OF THE WORLD AS YOU LOOK AT THESE POPULATIONS, AS YOU LOOK AT THEIR GENETIC SIMILARITIES FOR 250,000 SNPS. IMPORTANT POINT SHEER THAT IF WE LOOK AT MULTIPLE POLYMORPHISMS, WE LOOK AT 10,000 OR 100,000 OR A MILLION POLYMORPHISMS WE CAN WITH SOME ACCURACY PREDICT POPULATION AFFILIATION BECAUSE THESE SNPS, IF THERE ARE MANY OF THEM, ARE AS I SAID TELLING US ABOUT THESE NONOVERLAPPING PARTS OF THE CIRCLES. BUT WHAT WE CAN'T DO, THIS IS A CRITICAL POINT, WE CAN'T GO THE OTHER WAY. BECAUSE THESE SNPS VARY JUST IN FREQUENCY AMONG POPULATIONS. WE CAN'T BY LOOKING AT A POPULATION AFFILIATION WE CAN'T INFER WHAT SOMEONE'S SNP ALLELE IS GOING TO BE, OR CAN'T FROM A SINGLE SNP INFER POPULATION AFFILIATION. THAT I THINK IS A CRITICAL POINT. WELLF WE HAVE ENOUGH GENETIC INFORMATION COULD WE CLASSIFY EVERYBODY IN TO A POPULATION. LET'S LOOK AT THAT NETWORK THAT WE VIEWED JUST A FEW MINUTES AGO, LET'S ADD AFRICAN-AMERICAN. SOME INDIVIDUALS GROUP IN TO THIS GROUP DOWN HERE WITH PEOPLE FROM AFRICA. OTHERS DON'T REALLY FALL IN TO A GROUP. IF WE LOOK AT PUERTO RICANS, ANOTHER GROUP WITH A COMPLEX HISTORY, SOME TEND TO FALL IN WITH PEOPLE FROM SPAIN, OTHERS CLOSER TO PEOPLE FROM AFRICA. THERE ARE MANY GROUPS, MANY HUMAN GROUPS THAT DON'T FALL NEATLY IN TO ANY OF THESE CATEGORIES. AS WE SAMPLE THEM, AS WE LEARN MORE ABOUT THEM AS HUMAN MIGRATION CONTINUES TO INCREASE WE'LL FIND THAT THERE ARE A LOT OF GROUPS, LOT OF INDIVIDUALS THAT DON'T FALL IN TO ANY SPECIFIC GROUP. WE HAVE COMPLEX ANCESTORS. THEN I THINK EMPHASIZES THE FALLACY OF THINKING TYPE LOGICALLY, THIS IS ONE OF THE REASONS WHY I DON'T USE THE TERM "RACE" IN MY OWN PUBLICATIONS. BECAUSE I THINK IT TENDS TO ENCOURAGE THINKING ALONG THE LINES OF TYPES AND TWPOLOGIES WHEN IN FACT WE'VE SEEN MOST HUMAN GENETIC VARIATION IS SHARED AMONG POPULATIONS. LET ME ASK YOU A QUESTION. WAYNE JOSEPH, GREW UP IN LOUISIANA IN A CREOLE FAMILY. WHAT DO YOU THINK HIS ANCESTRY IS LOOKING AT HIS APPEARANCE? ANY GUESSES? FRENCH. YOU CAN TELL THIS IS A TRICK QUESTION, CAN'T YOU? WELL, HE WAS RAISED AS AN AFRICAN-AMERICAN. HE WAS HIGH SCHOOL PRINCIPAL IN CALIFORNIA, HE ISN'T HIS DNA OFF TO BE TESTED BY A COMPANY TO FIND OUT ABOUT HIS ANCESTRY. NOW, I SHOULD SAY THIS WE HAVE TO I TAKE THESE ANCESTRY ESTIMATES WITH A GRAIN OF SALT, A NUMBER OF ASSUMPTIONS INVOLVED BUT WHAT HE GOT BACK WAS THAT HE WAS 57% EUROPEAN, 39% NATIVE AMERICAN, 4% EAST ASIAN APPARENTLY NO AFRICAN GENES AT ALL. DESPITE HIS SELF REPORTED ANCESTRY. HE RETAINED HIS CULTURE, OF COURSE. BUT IT'S INTERESTING TO SEE HOW ONE SELF REPORTED ANCESTRY CAN DIFFER COMPLETELY FROM ONE DNA MEASURED ANCESTRY. AT LEAST AS ACCURATE AS THAT IS. THIS POINTS OUT VERY IMPORTANT DIFFERENCE. IN OUR DISCUSSION ABUT RACE THE DIFFERENT BETWEEN INDIVIDUAL ANCESTRY WHICH CAN BE VERY COMPLEX AND RACE WHICH IS VERY BLUNT TOOL. AN INDIVIDUAL WITH 90ERS AFRICAN ANCESTRY, 10% EUROPEAN ANCESTRY WOULD BE CONSIDERED AFRICAN AMERICAN IN THE UNITED STATES. ALSO AN INDIVIDUAL WITH ONLY SAY 30 OR 40% AFRICAN ANCESTRY IN THE UNITED STATES WOULD LIKELY SELF IDENTIFY AS AFRICAN AMERICAN. EVEN THOUGH THEIR GENETIC CONSTITUTION VERY DIFFERENT. THAT'S WHY I THINK IT'S SO IMPORTANT TO UNDERSTAND THIS DIFFERENCE BETWEEN INDIVIDUAL ANCESTRY AND WHAT WE REFER TO AS RACE. A FEW AGO I SENT MY DNA IN TO A COMPANY TO LEARN SOMETHING ABOUT MY OWN ANCESTRY. I WAS REALLY DISAPPOINTED, THIS IS ABOUT AS BORING A GENOME AS YOU CAN FIND, ACCORDING TO THE COMPANY, AT LEAST. MY ANCESTRY WAS 100% EUROPEAN, WITH NO INTERESTING VARIATION WHATSOEVER. I SHOULD SAY, ALL MY GRANDPARENTS CAME FROM NORWAY BUT I WAS HOPING THERE MIGHT SOMEBODY SORT OF ROGUE GENES IN THERE SOMEWHERE. YOU CAN COMPARE ME, THIS IS A WOMAN FROM THE BERBER POPULATION IN NORTH AFRICA. A MUCH MORE INTERESTING GENOME. HERE WE SEE MOSTLY EUROPEAN ANCESTRY, I THINK IT'S 86% BUT SOME SIGNIFICANT AFRICAN ANCESTRY, A LITTLE BIT OF ASIAN ANCESTRY AS ESTIMATED BY THE MARKERS USED BY THIS COMPANY. THE IMPORTANT POINT HERE, THOUGH, IS THAT SECTIONS OF THIS PERSON'S CHROMOSOME MIGHT BE OF EUROPEAN ORIGIN, THEY MIGHT BE OF AFRICAN ORIGIN. BIOMEDICALLY SIGNIFICANT GENES IN THIS PERSON MIGHT BE OF EUROPEAN ORIGIN, MIGHT BE OF AFRICAN ORIGIN. ONCE AGAIN ANCESTRY GIVES US A MUCH MORE COMPLEX VIEW OF THIS PERSON'S GENETIC LEGACY THAN DOES CATEGORY LIKE RACE. GOING BACK TO MY ANCESTRY, I ALSO DID GET A REPORT BACK ON MY Y CHROMOSOME, THIS IS Y CHROMOSOME THAT IS PRETTY COMMON IN NORTHERN EUROPEAN, NORTHERN EUROPE, I LEARNED THAT I SHARE IT WITH JIMMY BUFFET AND WARREN BUFFET. HASN'T DONE ANYTHING FUR MY SINGING OR INVESTING BUT IT WAS INTERESTING FACTOID. I DON'T HAVE GENGUS KHAN'S Y CHROMOSOME. I DON'T KNOW IF WE KNOW WHAT HIS REALLY WAS. THIS IS HISTORICAL FROLICKING I THINK. IT IS INTERESTING TO DO THIS SORT OF RECREATIONAL GENOMICS I ALSO GOT REPORT BACK ON MY MATERNAL HAPLO GROUP WHICH NOT SURPRISING IS ALSO COMMON IN EUROPE. WHAT DID THESE KINDS OF FINDINGS IMPLY FOR BIOMEDECIN? IF WE LOOK AT A LOT OF DNA POLYMORPHISMS WE CAN LEARN SOMETHING ABOUT ANCESTRY AND POPULATION 'TIS OTHER REWITH IMPORTANT QUALIFICATIONS THAT NOT EVERYONE FALLS IN TO GROUPS. AND THAT THERE ARE OF COURSE ASSUMPTIONS MADE IN MAKING THESE INFERENCES. RESPONSES TO THERAPEUTIC DRUGS MAY INVOLVE VARIATION IN FEW GENES AND ALSO GOING TO BE AFFECTED BY ENVIRONMENT. WHAT THAT MEANS IS THAT THOSE GENETIC CLASSIFICATIONS DON'T NECESSARILY TELL US THAT MUCH ABOUT BIOMED I CANNILY SIGNIFICANT PHENOTYPES. HERE IS A GREAT EXAMPLE. WE TALKED EARLIER ABOUT THE AFFECTS OF ACE INHIBITORS IN AFRICAN AMERICANS VERSUS EUROPEAN AMERICANS. THIS IS A METAANALYSIS PUBLISH ADD FEW YEARS AGO LOOKING AT THE DECREASE IN BLOOD PRESSURE THAT OCCURRED AFTER ADMINISTRATION OF ACE INHIBITORS IN THOUSANDS OF EUROPEAN HYPERTENSIVES AND AFRICAN AMERICAN HYPERTENSIVES WHAT WE SEE IS THAT ON AVERAGE THERE IS ABOUT A FIVE MILLIMETER DIFFERENCE IN RESPONSE, THAT IS AFRICAN AMERICANS ON AVERAGE DON'T RESPOND QUITE AS MUCH AS EUROPEAN AMERICANS. TO ACE INHIBITORS IN TERMS OF LOWERING THEIR BLOOD PRESSURE. BUT YOU CAN ALSO SEE THAT THERE'S A LOT OF OVERLAP BETWEEN THESE TWO CURVES. A LOT OF AFRICAN AMERICANS WOULD ACTUALLY RESPOND BETTER THAN A LOT OF EUROPEAN AMERICANS TO AN ACE INHIBITOR. ONCE AGAIN USING A CATEGORY LIKE RACE TO PREDICT RESPONSE GIVES US SOME INFORMATION BUT IT CAN ALSO MISLEAD US. HERE IS ANOTHER GOOD EXAMPLE. THE DRUG GEFITINIB, TREATS NON-SMALL CELL LUNG CANCER. IT'S AN EGFR INHIBITOR. IT IS EFFECTIVE AT LEAST FOR AWHILE IN IN ABOUT 10% OF EUROPEANS, ROUGHLY 30% OF ASIANS. SO YOU MIGHT BE TEMPTED TO THINK, WELL WE COULD USE POPULATION AFFILIATION TO HELP PREDICT WHO IS GOING TO RESPOND TO THIS DRUG. AFTER ALL THREEFOLD DIFFERENCE IN DIFFERENT POPULATIONS. SOMATIC MUTATIONS IN EGFR ARE SEEN IN ABOUT 10% OF EUROPEANS WHILE 30% OF JAPANESE PATIENTS, WHAT'S REALLY INTERESTING IS THAT, 80% OF THOSE WHO HAVE THESE EGFR MUTATIONS RESPOND TO GEFITINIB AND 10% WITHOUT THE MUTATION. WE CAN SEE WE GET MUCH BETTER PREDICTOR OF RESPONSE TO THE DRUG THAN IF WE LOOK AT POPULATION AFFILIATION. I'M GLAD TO SEE MY ANTI-VIRUS IS WORKING. THAT LEADS US TO A THEME THAT YOU'VE HEARD ABOUT AND HEAR MORE ABOUT IN THIS SERIES, TALK ABOUT PERSONALIZED MEDICINE THE NOTION THAT NOW THAT WE CAN LOOK AT INDIVIDUAL VARIATION THAT IS MUCH MORE APPROPRIATE TARGET AND MUCH MORE APPROPRIATE MEANS OF DECIDING THERAPY, ONCE WE HAVE THE INFORMATION, WITHIN WE CAN MAKE THE PREDICTION THAN USING BROAD CATEGORIES LIKE RACE OR POPULATION AFFILIATION. WHAT I HAVE TOLD YOU ABOUT WE DO SEE CORRELATION BETWEEN GEOGRAPHIC LOCATION AND GENETIC VARIATION BUT THAT VARIATION IF WE SAMPLE ENOUGH OF IT WE START TO SEE ESSENTIALLY CONTINUOUS NON-INTERRUPTED VARIATION ACROSS. I THINK THAT OUR TRADITIONAL CONCEPTS OF RACE MAY NOT BE ACTUALLY BIOLOGICALLY MEANINGLESS THAT MIGHT BE AN OVER STATEMENT. BUT IT'S BIOLOGICALLY VERY IMPRECISE, IT IS A BLUNT TOOL, CONCEPTS LIKE ANCESTRY LOOKED AT AT THE INDIVIDUAL LEVEL ARE CERTAINLY GOING TO BE MORE INFORMATIVE. WE HOPE THAT PERSONALIZED MEDICINE, WHEN IT BECOMES A REALITY, WILL BE MEDICALLY A LOT MORE USEFUL THAN CATEGORY LIKE ETHNICITIES OR RACE. I THINK FINALLY, THIS IS POINT THAT CAN'T BE EMPHASIZED ENOUGH, THERE IS NOTHING IN GENETICS THAT SUPPORTS RACIST THINKING, THINKING THAT WITHIN GROUP IS IN SOME WAY SUPERIOR TO ANOTHER. AND IN FACT I THINK A LOT OF EVIDENCE THAT CONTRADICTS THAT KIND OF THINKING BECAUSE WE CAN WITH GENETIC DATA ASCERTAIN HOW SIMILAR WE ALL FOR ONE ANOTHER. HOW MUCH VARIATION WE DO SHARE. I THINK ACTUALLY GENETICS IS AN IMPORTANT TOOL THAT CAN HELP TO COMBAT RACIST THINKING. WHAT I WOULD LIKE TO DO NOW BECAUSE I THINK 90 MINUTES IS TOO LONG FOR HUMANS TO SIT DOWN IN ONE PLACE I'M GOING TO SHOW YOU A NICE PRETTY PICTURE TAKEN FROM JUST A FEW MILES FROM MY HOUSE IN UTAH. I'M GOING TO ASK TO YOU STAND UP FOR ABOUT A MINUTE OR SO AND JUST STRETCH. WE'LL HAVE A LITTLE BREAK. WE'LL GO ON TO THE THIRD PART OF THE TALK. >> OKAY, I THINK WE'LL GET STARTED AGAIN. HOPE YOU ENJOYED YOUR BREAK. IN THE LAST PART OF MY TALK, WHAT I WOULD LIKE TO DISCUSS, YOU'LL BE DISCUSSING MORE OF THIS AS THIS SERIES PROCEEDS, IS HOW OUR UNDERSTANDING OF POPULATION GENETICS AND EVOLUTIONARY GENETICS HELPS US TO UNDERSTAND HAPLOTYPE DISTRIBUTIONS, THE CONTENTS AND HOW IT HELPS US TO DESIGN MORE EFFECTIVE GENE MAPPING STUDIES. THIS IS REALLY A BRIDGE BETWEEN POPULATION GENETICS AND EVOLUTIONARY GENETICS ON THE ONE HAND AND GENE MAPPING ON THE OTHER. THE TWO HAVE BECOME I THINK REALLY INTER-TWINED OVER THE LAST DECADE OR SO. SOMEBODY WITH INTEREST IN BOTH AREAS I'VE BEEN VERY GRATIFIED TO SEE MUTUAL INTEREST IN BOTH POPULATION GENETICS AND GENE MAPPING AND LOCATION. IF WE LOOK AT SNP FREQUENCIES ACROSS HUMAN POPULATIONS, WE FIND AT LEAST ROUGHLY THAT A SNP WITH A MINOR ALLELE FREQUENCY GREATER THAN 1% OCCURS ABOUT ONE EVERY 300 BASE PAIRS IN THE GENOME. THIS DEPENDS A LITTLE BIT ON POPULATION, BUT ROUGHLY WE CAN SAY THAT THERE ARE AT LEAST TEN MILLION SNP IN THE GENOME WHERE THE MINOR ALLELE FREQUENCY IS GREATER THAN 1%. THEY WOULD BE CONSIDERED UNDERATE ADDITIONAL DEFINITION POLYMORPHISM. A COMMON SINGLE NUCLEOTIDE POLYMORPHISM, THAT IS WITH THE MINOR ALLELE FREQUENCY GREATER THAN 5% WE HAVE ABOUT FIVE MILLION OR SO OF THOSE. AT LEAST ROUGHLY. THAT MEANS THAT AT EVEN RELATIVELY MODEST COST, LET'S SAY A TENTH OF A CENT PER SNP IF WE HAD -- WANTED TO GENOTYPE ALL FIVE MILLION OF THOSE VARIANTS, IT WOULD COST $5,000 PER PERSON. MAYBE TWO OR THREE THOUSAND BUT STILL A LOT OF MONEY PER SAMPLE. IF WE WANT TO DO CASE CONTROL ASSOCIATION STUDY COMPARING A THOUSAND CASES AND THOUSAND CONTROLS, THESE DAYS THIS IS PRETTY AFTERNOON MINIMUM SAMPLE SIZE, IT WOULD COST $15 MILLION TO GENOTIME ALL FIVE MILLION OF THOSE SNPS. SO, THIS WAS A REAL PROBLEM. DID WE REALLY NEED TO TEST ALL OF THESE SNP IN ORDER TO ASSESS VARIATION IN DOING A CASE CONTROL STUDY? WOULD THE SNP ASSOCIATION TEST REVEAL ANYTHING. I'LL START WITH COUPLE OF VERY SIMPLE DEFINITIONS BECAUSE I KNOW WE HAVE A DIVERSE AUDIENCE HERE. FIRST WE'LL DEFINE A HAPLOTYPE AS THE DNA SEQUENCE FOUND ON ONE MEMBER OF THE CHROMOSOME PAIR. IN THIS WE SEE TWO SETS OF ALLELES, SO FORTH. WE TRANSMIT THOSE HAPLOTYPES TO OUR OFFSPRING. NOW, AS YOU KNOW DURING MEIOSIS, CROSSOVERS CAN OCCUR BETWEEN HOMOLOGOUS PAIRS OF CHROMOSOMES LIKE THIS, RESULTING IN RECOMBINATION OF ALLELES, NOW THIS PARENT TRANSMITS THE NEW HAPLOTYPE. A HAPLOTYPE WITH A NEW COMBINATION OF ALLELES TO HIS OFFSPRING. THESE VERY FUND FUND MENTAL CROSS OVER ALLOW US TO ESTABLISH THE RELATIVE DIFFERENCES BETWEEN LOCI IN THE HUMAN GENOME. WE CAN ASK HOW OFTEN DO THEY OCCUR? OVER TIME THEN WE EXPECT MORE CROSSOVERS BETWEEN LOCI THAT ARE LOCATED FURTHER APART. A AND B ARE FURTHER APART. WE OBSERVE MORE CROSSOVERS BETWEEN THAT PAIR OF LOCI THAN BETWEEN B AND C. WHAT THAT MEANS IS THAT, AFTER MANY GENERATIONS WE'RE GOING TO FIND ALLELES B AND C, THAT IS BIG B AND C, TOGETHER ON THE SAME COPY OF THE CHROMOSOME MORE OFTEN THAN WE WILL FIND BIG A AND BIG B. BECAUSE THESE RECOMBINED TO GET LITTLE BIG A, WITH LITTLE B. AND EVENTUALLY WE TEND TO REASSORT THESE ALLELES. WHAT WE'RE SEEING IS THAT THERE IS MORE LINKAGE DISEQUILIBRIUM BETWEEN THESE PAIR OF LOCI THAN BETWEEN THIS PAIR OF LOCI. THIS PAIR OF LOCI ARE FOUND TOGETHER MORE FREQUENTLY THAN WE EXPECT BY CHANCE. THAT WHAT WE MEAN BY LINKAGE DISEQUILIBRIUM. HERE IS A LITTLE DIAGRAM, AGAIN ILLUSTRATING THE IDEA. THE LINKAGE, NONE RANDOM ASSOCIATION OF ALLELES AT LINKED LOCI. AT EQUILIBRIUM. WE WOULD EXPECT TO SEE EVERY POSSIBLE COMBINATION BIG A AND BIG B. LITTLE A AND LITTLE B AS WE LOOK AT COP PIECE OF CHROMOSOMES IN OUR POPULATION. WE CAN ASSESS THE FREQUENCIES OF THOSE ALLELES, BIG A AND LITTLE B. BY AN UNDER EQUILIBRIUM UNDER LINKAGE EQUILIBRIUM, YOU HAVE BIG A AND B TOGETHER THAT SHOULD EQUAL THE FREQUENCIES OF THOSE ALLELES IN THE POPULATION. THAT IS 60% "TIMES" 40%. AND 06% TIMES 30% OR 18% OF THE TIME. SO FORTH. IN OTHER WORDS, THESE LOCI, THEIR ALLELES ARE INDEPENDENT OF EACH OTHER THEY'RE AT EQUILIBRIUM. WE WE CAN MULTIPLY THEIR RESPECTIVE FREQUENCIES TO GET HAPLOTYPE FREQUENCY F. WE SEE A SUBSTANTIAL DEVIATION FROM THAT AS WE SEE IN THIS DIAGRAM WHERE BIG A AND BIG B ARE FOUND TOGETHER ON THE SAME COPY OF A CHROMOSOME ON THE SAME HAPLOTYPE MORE FREQUENTLY THAN WE EXPECT BY CHANCE, BIG A -- LITTLE A AND B ARE ALSO FOUND MORE FREQUENTLY THAN WE EXPECT BY CHANCE: THEN WE HAVE LINKAGE DISEQUILIBRIUM. THERE HAVEN'T BEEN ENOUGH RECOMBINATIONS TO RANDOMLY REASSORT THESE ALLELES IN OUR POPULATION. WHAT THAT SUGGESTS IS THAT A AND B ARE LIKELY FAIRLY CLOSE TOGETHER. IMAGE HOW LINKAGE DISEQUILIBRIUM WOULD ARISE. QUICK EQUILIBRIUM QUICK FOR A FEW GENERATIONS EVERY TIME WE SEE THE CF MUTATION WE'RE GOING TO SEE THESE ALLELES NEARBY. THESE ALLELES, ALTERNATIVE SHOWN HERE ARE NOT ASSOCIATED WITH THE MUTATION. THEY'RE GOING TO BREAK UP THE ASSOCIATIONS. ON THIS CHROMOSOME COPY IN THE PRESENT DAY POPULATION, WE SEE OUR CF MUTATION CO-OCCURRING NOT WITH BIG B BUT WITH LITTLE D. BUT WE STILL TEND TO SEE ASSOCIATIONS BETWEEN OUR MUTATIONS AND SNPS THAT ARE VERY NEARBY. SO EVACUEE STILL SEE UPPER CASE G -- WE -- WHEN WE SEE OUR DISEASE CAUSING MUTATION BECAUSE IT'S VERY CLOSE. SO CLOSE THAT REBY MAKES STILL HASN'T HAD TIME -- RECOMBINATION TO REASSORT OUR MUTATION WITH THE ALTERNATIVE ALLELE, LITTLE G. THIS IS WHAT WE MEAN BY LINKAGE DISEQUILIBRIUM TO PINPOINT THE CYSTIC FIBROSIS GENE BACK IN THE LATE 1980S. IN FAMILIES WITH LINKAGE DISEQUILIBRIUM THIS IS SOMETHING THAT WE CAN ESTIMATE IN POPULATION. WE HAVE OF COURSE MICROARRAY TECHNOLOGY NOW THAT ALLOWS US TO LOOK AT VERY DENSE ARRAYS OF SNPS TO DO OUR GENOTYPING. THE REAL ADVANTAGE OF LINKAGE DISEQUILIBRIUM IS THAT IT IN ESSENCE INCORPORATES MANY PAST GENERATIONS OF RECOMBINATION. EVERY RECOMBINATION THAT HAS OCCURRED SINCE THE MUTATION TOOK PLACE. I'D LIKE TO SHOW THIS IS AS CONTRAST. HERE IS A THREE GENERATION OF FAMILIES, COUNSELED RECOMBINATIONS TO ESTIMATE THE DISTANCE BETWEEN LOCI. WE'RE LIMITED TO THE NUMBER OF GENERATIONS THAT WE CAN ACTUALLY COLLECT. BUT WITH LINKAGE DISEQUILIBRIUM WHAT WE'RE EFFECTIVELY DOING IS GOES GO BACK TO THE COMMON ANCESTOR IN WHOM THIS DISEASE FIRST OCCURRED. OUR HOME IN THESE FAMILIES THEY ALL SHARE THE SAME DISEASE CAUSING MUTATION DESCENDED FROM THE COMMON ANCESTOR WITH LINKAGE DISEQUILIBRIUM WE'RE INCORPORATING THE EFFECTS OF THE RECOMBINATIONS OCCURRED OVER MANY GENERATIONS. SINCE THE MUTATION FIRST OCCURRED. WHAT THAT MEANS IS THAT LINKAGE DISEQUILIBRIUM CAN ALLOW US TO MORE FINALLY LOCALIZE THE DISEASE CAUSING MUTATION BECAUSE WE HAVE LOT MORE RECOMBINATIONS EFFECTIVELY TO LOOK AT. BECAUSE POPULATIONS WITH REGARD TO ANY SPECIFIC MUTATION ARE EE ISN'T SYSTEMLY ONE BIG PEDIGREE. ONE BY KATE PEDIGREE CHASING BACK -- THIS LINKAGE DISEQUILIBRIUM HAS INTERESTING HISTORY, WHEN I FIRST BECAME INTERESTED IN IT, I HAVE TO ADMIT IT WAS A LONG TIME AGO BACK ABOUT 1982 A GUY NAMED DAVID BARKER WHO WAS POST DOC OF RAY WHITE CAME TO MY OFFICE HAD FOUR BRAND NEW RFLPS. BACK IN THOSE DAYS FOUR NEW POLYMORPHISMS WAS A BIG DEAL HE GOT THE LEAD ARTICLE IN THE AMERICAN JOURNAL OF HUMAN GENETICS THAT ISSUE. WE WERE LOOKING AT LINKING A DISEQUILIBRIUM PATTERNS THAT'S WHEN I GOT INTERESTED IN THAT PHENOMENON OF LINKAGE DISEQUILIBRIUM. I HAS INTERESTING PROPERTIES IN POPULATION GENETICS. BUT IF YOU LOOK BACK AT THAT TIME, ONLY ABOUT 20 ARTICLES PER YEAR WERE PUBLISHED ON LINKAGE DISEQUILIBRIUM. COULD READ A PAPER KNOW EVERYTHING THERE WAS ABUT THIS TOPIC. NOW WE'RE CLOSE TO 2,000 PAPERS. YOU'D HAVE TO READ 30 OR 40 PAPERS A WEEK TO KEEP UP THAT WITH LITERATURE. NOT THAT ALL ARE WORTH READING. THIS INDICATES HOW MUCH INTEREST THERE IS IS IN THE TOPIC NOW OF LINKAGE DISEQUILIBRIUM. RELATIVE TO SAY 25 YEARS AGO. THE QUESTION IS, IS THERE A SIMPLE, UNIFORM RELATIONSHIP BETWEEN INTER-LOCUS PHYSICAL DISTANCE IN THE GENOME AND LINKAGE DISEQUILIBRIUM. IN OTHER WORDSF WE KNOW THE AMOUNT OF LINKAGE DISEQUILIBRIUM, THE AMOUNT OF NONRANDOM ASSOCIATION BETWEEN TWO LOCI, HOW WELL CAN WE PREDICT HOW FAR APART THEY ACTUALLY ARE. THIS IS THE RELATIONSHIP THAT WE WOULD EXPECT THAT HAS DISTANCE BETWEEN LOCI INCREASES, LINKAGE DISEQUILIBRIUM DECREASES. IT GOES EVENTUALLY TO ZERO. IT'S OFTEN MEASURED WITH R, CORRELATION COEFFICIENT SO THAT IN COMPLETE DISEQUILIBRIUM, THAT IS WHEN WE CAN PERFECTLY PREDICT THE ALLELE AT ONE LOCUS IF WE KNOW THE ALLELE STATUS AT NEARBY LOCUS THEN RAILROAD IS EQUAL TO ONE. IF THERE'S NO RELATIONSHIP UNDER EQUILIBRIUM R IS EQUAL TO ZERO NO CORRELATION. THIS IS THE RELATIONSHIP THAT WE EXPECT. SOME YEARS AGO WE LOOKED AT A NUMBER OF RFLPS NEAR THE GENE. WE WERE INTERESTED IN THE DISTANCE, EACH OF THESE POINTS REPRESENTS A PAIR OF POLYMORPHISMS AND THE QUESTION WAS, DOES DISEQUILIBRIUM BETWEEN PAIRS OF POLYMORPHISMS DECREASE AS WE LOOK AT POLYMORPHISMS THAT ARE FURTHER APART. HERE WE LOOKED ACROSS ABOUT 600KB. INDEED WE DID FIND SIGNIFICANT NEGATIVE RELATIONSHIP BETWEEN DISEQUILIBRIUM AND DISTANCE BETWEEN THESE POLYMORPHIC MARKERS. THAT WAS EARLY INDICATION THAT LINKAGE DISEQUILIBRIUM POTENTIALLY COULD BE USED TO FINALLY LOCALIZE GENES ON CHROMOSOMES. HERE IS ANOTHER EXAMPLE OF THIS PLOT WHICH IVORY DUESED TO SMALL SIZE HAS SERIES OF POINTS HERE, THEN SERIES OF POINTS THAT DOESN'T CORRESPOND TO OUR RELATIONSHIP AT ALL. IT'S NOT THE DECREASE, MONOTONIC DECREASE IN DISEQUILIBRIUM WITH DISTANCE THAT WE WOULD EXPECT. THIS WAS AN ANALYSIS WE DID IN THE NEUROFIBROMITOSIS TYPE ONE REGION. WHAT WE FOUND FOR ALL OF THESE MARKERS, ALL PAIRS WERE IN SUBSTANTIAL DISEQUILIBRIUM WITH AN R FALL VIEW GREATER THAN .82 BUT THERE WAS ANOTHER ONE JUST 68KB AWAY FROM ITS NEAREST NEIGHBOR WHERE THERE WAS NO DISEQUILIBRIUM BETWEEN IT AND ALL OF THE OTHER MARKERS. WE HAD LOT OF HYPOTHESES AT THE TIME TO ACCOUNT FOR THIS. IT WAS A GC-RICH REGION, RECOMBINATION IS SOMEWHAT ASSOCIATED WITH GC CONTENT. BUT WHAT ULTIMATELY WE LEARNED FROM THE HAPMAP DATA IS THAT THERE IS A RECOMBINATION HOT SPOT RIGHT IN THIS AREA. SO THE RECOMBINATION HOT SPOT LOCATED RIGHT HERE EXPLAINS WHY THIS POLYMORPHISM IS NOT IN EQUILIBRIUM -- NOT IN DISEQUILIBRIUM, EVEN WITH ONE THAT IS ONLY 70KB AWAY. WE DON'T ALWAYS SEE THE UNIFORM RELATIONSHIP THAT WE MIGHT EXPECT BETWEEN DISTANCE, THAT IS PHYSICAL DISTANCE BETWEEN POLYMORPHISMS AND LINKAGE DISEQUILIBRIUM BETWEEN THEM. BECAUSE AMONG OTHER THINGS WE HAVE HOT SPOTS THROUGHOUT THE HUMAN GENOME. THERE ARE NUMBER OF FACTORS THAT CAN AFFECT LINKAGE DISEQUILIBRIUM PATTERNS. CHROMOSOME LOCATION, RECOMBINATION MORE COMMON WITH TELOMERES THAN ELSEWHERE. DNA SEQUENCE PATTERNS, GC CONTENT WE FOUND THAT ALU ELEMENTS INFLUENCE RECOMBINATION AND INCREASE RECOMBINATION BY A FEW PERCENT. THERE ARE A LOT OF ALU ELEMENTS THROUGHOUT THE GENOME. THERE ARE RECOMBINATION HOT SPOTS EVERY 50 TO 100KB, WHAT I THINK IS ESPECIALLY INTERESTING IS THAT EVOLUTIONARY FACTORS INFLUENCE PATTERNS OF LINKAGE DISEQUILIBRIUM IN THE HUMAN GENOME. THINGS LIKE NATURAL SELECTION, GENE FLOW, GENE CONVERSION, GENETIC DRIFT. ALL OF THE FACTORS THAT PEOPLE LIKE TO THINK ABOUT INFLUENCE PATTERNS OF LINKAGE DISEQUILIBRIUM BECAUSE DISEQUILIBRIUM REFLECTS THE HISTORIES OF POPULATIONS. THE FACTORS IN POPULATIONS THAT HAVE AFFECTED GENETIC VARIATION. THERE ARE SOME INTERESTING IMPLICATIONS OF OUR POPULATION GENETIC STUDIES FOR DISEQUILIBRIUM PATTERNS. WE'VE SEEN THAT THERE IS CONDITION VARIATION, EVEN VARIATION WITHIN MAJOR CONTINENTS THAT IS GOING TO AFFECT STRATIFICATION PATTERNS SHOULD BE TAKEN IN TO ACCOUNT WHEN WE'RE DESIGNING CASE CONTROL ASSOCIATION STUDIES. THE FACT THAT THE AFRICAN POPULATIONS WERE FOUNDED FURTHER AGO IN TIME, IN OTHER WORDS, THEY HAVE -- WE CAN SAY A GREATER AGE IMPLIES THAT WE SHOULD SEE LESS LINKAGE DISEQUILIBRIUM IN THOSE POPULATIONS. THAT IS THERE HAS BEEN MORE TIME FOR RECOMBINATIONS TO OCCUR IN THOSE POPULATIONS, WE'RE GOING TO SEE LINKAGE DISEQUILIBRIUM PERSISTING OVER SHORTER DISTANCES BECAUSE OF MORE RECOMBINATIONS. WE'VE SEEN GREATER DIVERGENCE OF AFRICAN POPULATIONS, WHAT THAT IMPLIES IS THAT WHAT WE SOMETIMES CALLED ADMIXTURE LINKAGE DISEQUILIBRIUM, REFLECTS MIX TOURS OF AFRICAN AND NON-AFRICAN POPULATIONS WE DON'T HAVE TIME HERE TO TALK ABOUT AD MIXTURE DISEQUILIBRIUM, BUT IT'S STARTING TO BE APPLIED WITH SOME LEVEL OF SUCCESS. HERE IS WAY IN WHICH POPULATION GENETICISTS ARE UNDERSTANDING HAPPEN LAY TYPE STRUCTURE AND GENE MAPPING. IF WE THINK OF POPULATIONS THAT WERE FOUNDED A LONG TIME AGO THERE HAVE BEEN MANY GENERATIONS OR RECOMBINATIONS TO OCCUR AS WE SEE HERE. THAT MEANS THAT WE'RE GOING TO SEE RELATIVELY SHORT GROUPS OF LAP LOW TYPES, OR HAPLOTYPE BLOCK, IS THAT THEY'RE CALLED. A POPULATION THAT WAS FOUNDED RELATIVELY RECENTLY EXAMPLE MIGHT BE POPULATION OF FINLAND, OF MOST OF WHICH WAS FOUNDED JUST COUPLE THOUSAND YEARS AGO. THERE HASN'T BEEN AS MUCH TIME TO RECOMBINATIONS TO OCCUR IN THOSE -- IN POPULATION LIKE THAT. SO WE HAVE FEWER HAPLOTYPES IN LARGER BLOCKS. THERE'S MORE DISEQUILIBRIUM, LESS HAPLOTYPE DIVERSITY. SO IF WE THINK ABOUT MUTATION, THAT MAY HAVE OCCURRED IN THAT POPULATION, COUPLE THOUSAND YEARS AGO, IT'S GOING TO BE IN DISEQUILIBRIUM WITH A LARGE NUMBER OF SNPS. HASN'T BEEN MUCH TIME FOR RECOMBINATION TO CAUSE THOSE TO DECAY. IN CONTRAST MUTATION THAT OCCURRED IN AN AFRICAN POPULATION WILL HAVE HAD MORE TIME TO DURING WHICH CAN REDUCE ASSOCIATION WITH NEARBY SNPS. WE FIND THAT MUTATION IN ASSOCIATION WITH SMALLER NUMBER OF SNPS. IN OTHER WORDS WE'RE GOING TO NEED MORE SNPS IN THIS POPULATION TO FIND ASSOCIATION THAN IN THIS POPULATION. CONVERSELY IN THIS POPULATION, WE CAN MORE FINALLY MAP THE LOCATION OF A MUTATION BECAUSE IT'S IN ASSOCIATION WITH JUST A FEW NEARBY SNPS. SOME IMPORTANT ATTRIBUTES OF HISTORY THAT HELP TO INFORM US ABOUT THE DESIGN OF ASSOCIATION STUDIES. IF WE LOOK AT SOME REAL DATA, THIS IS KIND OF DISPLAY THAT WE GET FROM THE PROGRAM HAPLOVIEW. LET ME EXPLAIN WHAT WE'RE LOOKING AT HERE BECAUSE WE SEE THESE ALL THE TIME IN THE ASSOCIATION STUDY LITERATURE. THIS IS A MAP OF LINKAGE DISEQUILIBRIUM. EACH OF THESE LITTLE COLUMNS HERE REPRESENTS A SNP. THEY ARE ARRAYED ACCORDING TO THEIR PHYSICAL LOCATION ACROSS THE CHROMOSOME, AND THEN EACH OF THESE SQUARES LIKE THIS RED SQUARE HERE, INDICATES THE LINKAGE DISEQUILIBRIUM BETWEEN A PAIR OF SNPS. FOR THIS ADJACENT PAIR OF SNPS RIGHT HERE, WE HAVE RED, WE HAVE HIGH DISEQUILIBRIUM, OR THIS PAIR, THAT IS THIS ONE AND THIS ONE, WE HAVE LITTLE DISEQUILIBRIUM. AN ANALOGY WOULD BE THE MILEAGE CHARTS THAT SOME OF US HAVE USED WHERE WE CAN TAKE ANY PAIR OF CITIES, LET'S SAY NEW YORK AND SAN FRANCISCO AND WE CAN SAY, WHAT'S THE DISTANCE BETWEEN THEM. FOR A PAIR OF SNPS WHAT IS THE DISEQUILIBRIUM. THERE'S A LOT MORE DISEQUILIBRIUM IN THIS EURASIAN SAMPLE THAN THE AFRICAN SAMPLE CONSISTENT WE SEE SNIPS OCCURRING IN MUCH LARGER -- SNPS IN HAPLOTYPE BLOCKS IN THESE POPULATIONS THAN IN THESE. THAT HAS IMPORTANT IMPLICATIONS FOR STUDY DESIGN. I'VE SHOWED YOU EXAMPLES, NOW GENERAL ARE THESE PATTERNS? IF WE LOOK ACROSS THE GENOME, WHAT KINDS OF PATTERNS DO WE SEE? BACK ABOUT TEN YEARS AGO OUR KNOWLEDGE OF THE HUMAN GENOME, OF LINKAGE DISEQUILIBRIUM ACROSS THE HUMAN GENOME WAS LOT LIKE THIS MAP OF THE WORLD FROM 1544. WE REALLY DIDN'T KNOW MUCH ABOUT PATTERNS OF DISEQUILIBRIUM OR HAPLOTYPE STRUCTURE ACROSS THE GENOME. AND IF YOU LOOK AT THIS MAP WE HAVE FAIRLY GOOD REPRESENTATION OF EUROPE, SOME OF AFRICA AND ASIA, YOU SEE NORTH AMERICA IS COMPLETELY ABSENT IN 1544. THAT'S HOW OUR KNOWLEDGE OF HAPLOTYPE STRUCTURE ACROSS THE GENOME WAS ROUGHLY 10 OR 12 YEARS AGO. THAT'S WHAT LED TO THE HAPMAP PROJECT. I WANT TO MENTION THIS YOU'LL HEAR MORE ABOUT IT OTHER LECTURES AS WELL. THE ORIGINAL IDEA WAS TO LOOK AT LARGE COLLECTION OF SNPS, 6700,000 EVENTUALLY WENT TO A MILLION THEN MORE AFTER THAT. IN INDIVIDUALS FROM THREE MAJOR POPULATIONS. 906 THEM IN 30TRIOSER FROM THE UTAH CEPH COLLECTION. THIS REPRESENTED NORTHERN EUROPE. 90 YORUBAN AND 90 EAST ASIAN INDIVIDUALS. LOOK AT PATTERN OF LINKAGE DISEQUILIBRIUM IN THESE DIFFERENT POPULATIONS AND TO LOOK AT HAPLOTYPE STRUCTURE TO SEE TO WHAT EXTENT THESE VARIED AMONG POPULATIONS AND ACROSS THE GENOME. THERE WERE SOME INTERESTING ISSUES THAT CAME UP IN THE EARLY DISCUSSIONS OF THE HAPMAP. I WAS LUCKY ENOUGH TO BE PART OF THOSE DISCUSSIONS, ONE OF THE ISSUES WAS HOW BEST TO SAMPLE HUMAN DIVERSITY IF YOU CAN ONLY SAMPLE FEW POPULATION. DECISION WAS TO TRY TO LOOK AT FAIRLY BROAD SAMPLING BUT BY NO MEANS COMPLETE SAMPLING OF HUMAN DIVERSITY. OF COURSE SAMPLE SIZE ISSUES, ISSUES INVOLVING SNP ASCERTAINMENT AND DENSITY THEN ALSO NUMBER OF ETHICAL, LEGAL, SOCIAL ISSUES, THINGS LIKE INFORMED CONSENT. EVEN SOME DISCUSSION OF WHETHER WE SHOULD NAME THE POPULATIONS OR NOT OR WHETHER THE THREE POPULATIONS SHOULD NOT BE IDENTIFIED BECAUSE OF CONCERNS ABOUT POTENTIAL STIGMATIZATION. THE POPULATION GENETICISTS FELT BECAUSE WE KNOW POPULATION HISTORY AFFECTS HOP LOW TYPE STRUCTURE AND DISGIBB WE DON'T NAME THE -- WE DON'T KNOW THEIR HISTORY THAT WOULD BE SEVERE LIABILITY. FOR THAT REASON WE DECIDED TO NAME THE POPULATION AND I THINK THAT IS ADDED A LOT OF USEFULNESS IN INFORMATION TO THE HAPMAP SAMPLES. SUBSEQUENT TO THAT I THINK OUR MAP OF THE WORLD IMPROVED. YOU CAN SEE CALIFORNIA FOR SOME REASON STILL MISSING FROM THIS MAP. BUT BY AND LARGE OUR KNOWLEDGE OF THE HUMAN GENOME IMPROVED -- OUR KNOWLEDGE OF DISEQUILIBRIUM IN THE GENOME IMPROVED A GREAT DEAL. THERE HAVE BEEN A NUMBER OF INTERESTING APPLICATIONS OF THE HAPMAP. FIRST OF ALL UNDERSTANDING WORLDWIDE GENOME WIDE PATTERNS OF HAPLOTYPE DIVERSITY, DETECTING RECOMBINATION HOT SPOTS THROUGHOUT THE GENOME, DETECTION OF GENES THAT HAVE EXPERIENCED NATURAL SELECTION THEN OF COURSE DETECTION OF DISEASE CAUSING MUTATIONS. HERE IS AN EXAMPLE LOOKING AT THE DECAY OF DISEQUILIBRIUM ACROSS GENOMIC REGIONS, IN THE HAPMAP POPULATION, THE ASIAN, EUROPEAN, AFRICAN POPULATIONS, YOU CAN SEE THAT AS WE WOULD EXPECT WITH MORE RECOMBINATIONS DISEQUILIBRIUM DECAYS MORE QUICKLY WITH PHYSICAL DISTANCE IN THE AFRICAN POPULATION. IN THE YOURURBA,N AND MORE RECENTLY FOUNDED WE SEE THAT DISEQUILIBRIUM DOESN'T DECAY QUITE AS RAPIDLY. AGAIN MORE RECENT HISTORY, FEWER RECOMBINATIONS, MORE LINKAGE DISEQUILIBRIUM. WOULD GET PICTURES WITH THESE DATA. ONE OF THE REALLY IMPORTANT CONSEQUENCES OF HAPMAP IS THAT WE'VE LEARNED THAT BECAUSE OF THE PATTERN OF DISEQUILIBRIUM ACROSS THE GENOME A LOT OF SNPS ARE EFFECTIVELY REDUNDANT. IF WE KNOW THAT THIS PERSON HAS A C AT THIS POSITION, THEY HAVE T AT ANOTHER POSITION, AN A AT ANOTHER POSITION, BECAUSE OF LINKAGE DISEQUILIBRIUM. WHERE AS PERSON B HERE HAS AN A AT THIS POSITION, G HERE, AND A C HERE, WHAT THAT TELLS US IS THAT WE ONLY NEED TO TYPE THIS ONE IN ORDER TO KNOW THE GENOTYPES OF THESE. IN OTHER WORDS, WE DON'T HAVE TO TYPE ALL FIVE MILLION COMMON SNPS. WE CAN TYPE A SUBSET OF THEM WHAT WE CALL TAGGING SNPS AND GET PRETTY GOOD PICTURE OF THE DIVERSITY ACROSS THE GENOME BY LOOKING AT THAT SUBSET OF VARIATION IS THAT IN ITSELF IS A HUGE SAVING OF MONEY. THE TACT THAT WE CAN TYPE MAYBE A MILLION SNPS IN NON-AFRICAN POPULATIONS AND ESSENTIALLY GET THE HAPLOTYPE DIVERSITY ACROSS THE GENOME INSTEAD OF TYPING ALL FIVE MILLION IS A HUGE SAVINGS. WE FIND NUMBER OF STUDIES LOOKED AT THE PORTABILITY OF THE HAPMAP TAGGING SNPS ACROSS POPULATIONS. WE FIND THAT IN GENERAL THEY ARE PRETTY PORTABLE. THAT IS YOU CAN INFER PATTERNS OF DISEQUILIBRIUM FROM ONE MAJOR POPULATION WITHIN A CONTINENT TO ANOTHER AND MOST OF THE TIME GET IT RIGHT. AND FINDING HOT SPOTS WHERE DISEQUILIBRIUM SUDDENLY DECLINES. WE DEFINE A REBY NATION HOT SPOT WHERE RECOMBINATION IS ELEVATED TENFOLD. IT'S BEEN QUITE INTERESTING TO DISCOVER THAT THERE ARE TENS OF THOUSANDS OF HOT SPOTS IN THE HUMAN GENOME. ROUGHLY ONE EVERY 50 TO 100KB. WE RECENTLY LOOKED AT FAMILY, TWO PARENTS TWO, OFFSPRING LOOKED THEIR WHOLE GENOME SEQUENCE FOUND 155 RECOMBINATIONS, 92 OF THEM WERE IN RECOMBINATION HOT SPOTS. IN GENERAL THE DATA TELLS US THAT MOST CROSSOVERS AT LEAST 60% OCCUR IN ONLY ABOUT 10% OF THE GENOME. HOT SPOTS REALLY ARE SIGNIFICANT IN TERMS OF ACCOUNTING FOR MOST CROSSOVERS. ANOTHER REALLY INTERESTING FINDING FROM THESE STUDIES OF RECOMBINATION IS THAT THEY'RE NOT AT ALL CONGRUENT IN HUMAN AND CHIMP. EVEN THOUGH OUR DNA SEQUENCE IS 99% THE SAME, OUR HOT SPOTS ARE VERY, VERY DIFFERENT. SUGGESTING THAT THESE EVOLVE VERY RAPIDLY, THEY MAY NOT BE SEQUENCE DEPENDENT THEY MAY INVOLVE EPIGENETIC MECHANISMS. SO ALL KINDS OF INTERESTING QUESTIONS THAT CAN BE ADDRESSED WITH DATA SUCH AS THOSE OF THE HAPMAP. , WE CAN DETECT NATURAL SELECTION IN THE GENOME. THIS SLIDE SKETCHES OUT HOW WE DO THAT. IMAGINE THAT A VARIANT OF THE INTEREST HAS OCCURRED, AND AS WE SAW IT OCCURS ON A SPECIFIC CHROMOSOME BACKGROUND SO WE WILL SEE IT AT FIRST IN ASSOCIATION WITH NEARBY SNPS. BUT OF COURSE WHEN IT FIRST OCCURS IT'S ALLELE FREQUENCY OVER HERE IS VERY LOW. IF IT'S A NEUTRAL VARIANT IT MAY RISE IN FREQUENCY THROUGH TIME AS A RESULT OF GENETIC DRIFT. BUT THAT INCREASES IN FREQUENCY IS GOING TO BE VERY SLOW. WHAT HAPPENS AS THIS VARIANT, THE RED STAR, INCREASES IN FREQUENCY THROUGH TIME IS THAT BECAUSE OF RECOMBINATION IT IS ASSOCIATED WITH FEWER AND FEWER NEARBY SNPS THROUGH TIME. WE'RE GOING TO SEE VERY LITTLE DISEQUILIBRIUM BETWEEN THIS VARIANT AND LET'S SAY THIS SNP, ONCE THE VARIANT GETS TO LET'S SAY 10% IN FREQUENCY. BUT IF IT'S UNDER SELECTION, IF THERE HAS BEEN RECENT POSITIVE SELECTION FOR THAT VARIANT, LET'S SAY CONFER SOME SORT OF ADAPTIVE ADVANTAGE, IT WILL RISE QUICKLY TO HIGH FREQUENCY, LET'S SAY 10 OR 12%. AND BECAUSE OF SELECTION, BECAUSE SELECTION HAS CAUSED IT TO RISE IN FREQUENCY VERY QUICKLY, IT WILL STILL BE IN DISEQUILIBRIUM WITH MANY NEARBY SNPS. WE'LL HAVE VERY LONG RANGE OF LINKAGE DISEQUILIBRIUM AROUND THAT VARIANT. WE'LL SEE THAT WHEN WE LOOK AT GENOMIC DATA WE'LL SEE A REGION IN WHICH THERE IS HIGH DISEQUILIBRIUM OVER UNEXPECTEDLY LARGE DISTANCE. THIS IS A SIGNATURE OF STRONG POSITIVE SELECTION ON THIS VARIANT. THAT IS THAT THE VARIANT HAS RISEN TO HIGH FREQUENCY AND IS IN LARGE LINKAGE DISEQUILIBRIUM BLOCK. THIS APPROACH HAS BEEN USED TO DETECT NATURAL SELECTION INVOLVING A NUMBER OF PHENOTYPES. MALARIA RESISTANT, HEMOCHROMATOSIS, LACK TAKES PERSISTENT, SKIN PIGMENTATION, SO FORTH. ANOTHER INTERESTING APPLICATION OF THESE DATA BINDING REGIONS THAT HAVE BEEN STRONGLY AFFECT BY POSITIVE SELECTION IN HUMAN POPULATION. HAS HAD SOME REAL SUCCESSES IN LOCALIZING SINGLE GENE DISORDERS. THAT IS WHERE THEY WERE LOCI WERE FIRST MAPPED ROUGHLY BY USING LINKAGE ANALYSIS THEN DISEASE CAUSING GENE WAS FOUND USING LINKAGE DISEQUILIBRIUM ANALYSIS. TO PINPOINT THE ACTION GENE. MY DISPLAY HAS JUST FROZEN. THESE KINDS OF STUDIES ARE VERY SUCCESSFUL IF WE -- IF MOST CASES OF THE DISEASE ARE CAUSED BY SINGLE MUTATION. THAT MAKES SENSE. IF THIS MUTATION IS ONLY ONE OR THE PRINCIPLE ONE THAT CAUSES DISEASE, THEN WE'RE GOING TO BE ABLE TO EASILY DETECT 'SOAKS S QUAKES EVER DISEASE PHENOTYPE AND NEARBY SNPS. IMAGINE IF WE HAVE MULTIPLE DISEASE CAUSING MUTATIONS. THEN SOMETIMES WE'RE GOING TO SEE THE DISEASE WHEN WE SEE ONE GENOTYPE OR OTHER, WHEN THERE ARE MULTIPLE DISEASE CAUSING MUTATIONS, WHEN THERE IS SUBSTANTIAL ALLELIC HETEROGENEITY THAT PRESENTS REAL CHALLENGES IN DOING CASE CONTROL ASSOCIATION STUDIES. SO, ONE OF OUR ISSUES IS HOW CAN WE REDUCE THAT HETEROGENEITY AND ENHANCE THE GENETIC SIGNAL. WELL, CLEARLY, CONSISTENT TRAIT DEFINITION, USE OF INTERMEDIATE PHENOTYPES WHO HELP TO DECREASE HETEROGENEITY. WE CAN IDENTIFY SUBTYPES, THOSE WITH EARLY ONSET, SEVERE EXPRESSION, THIS IS WHERE CLINICIANS CAN BE ESPECIALLY HELPFUL BECAUSE TYPICALLY CLINICIANS UNDERSTAND THOSE SUBTYPES AND CAN INFORM GENETICIST AS TO WHICH GROUP OF CASES SHOULD USED IN AN OWE SAYS QUAKES STUDY. USE OUR KNOWLEDGE OF EVOLUTIONARY HISTORY. TO DEFINE POPULATION IN VERY STRICT AND NARROW FASHION SO THAT WE HAVE AS UNIFORM AN EVOLUTIONARY HISTORY AS POSSIBLE. THERE MAY BE SITUATIONS IN WHICH POPULATION ISLETS WILL BE SPECIAL UTILITY. SO THE BOTTOM LINE IS SOME OF YOU HAVE SEEN THESE DISPLAYS BEFORE WE CAN NOW POINT TO QUITE A FEW GENOME WIDE ASSOCIATION STUDIES THAT HAVE BEEN SUCCESSFUL IN UNCOVERING VARIANTS FOR COMMON COMPLEX DISEASE. THERE'S STILL A LOT LEFT TO BE DISCOVERED. BECAUSE I THINK INTELLIGENT STUDY DESIGN, MUCH OF IT INFORMED BY OUR KNOWLEDGE OF POPULATION GENETICS, THESE KINDS OF STUDIES HAVE BEEN MUCH, MUCH MORE SUCCESSFUL OVER THE LAST COUPLE OF YEARS THAN PREVIOUSLY. THIS IS SOMETHING ELSE THAT YOU WILL HEAR ABOUT, I THINK KAREN MULKEY WILL TALK ABOUT THIS LATER IN THIS SERIES. SO, TO SUMMARIZE, WHAT I'VE TOLD YOU THIS MORNING, WE SEE THAT GENETIC VARIATION, WE LOOK AT SNP MICROARRAYS ARE LOOKING AT WHOLE GENOME SEQUENCE, IT DOES CONTAIN USEFUL INFORMATION ABOUT OUR POPULATION HISTORY, OUR ANCESTRY. I THINK THAT OUR STUDIES OF GENETIC VARIATION GIVES US MORE INFORMED, MORE NUANCED VIEW OF THE CONCEPT OF RACE AND TELL US MUCH MORE ABOUT MEDICAL RELEVANCE THAN IF WE USE THESE BROAD CATEGORIES LIKE POPULATION AFFILIATION OR RACE, AND POPULATION GENETIC ANALYSIS ESPECIALLY IN THE CONTEXT OF LINKING A DISEQUILIBRIUM HAS PLAYED A CENTRAL ROLE IN UNDERSTANDING LINKAGE DISEQUILIBRIUM AND HOW IT IS 'PLOWED TO MAPPING AND LOCALIZING DISEASE CAUSING MEANS. FINALLY I HOPE YOU'VE GOTTEN SOME SENSE OF SOMETHING THAT NOT EVERYONE APPRECIATES, POPULATION GENETICS CAN BE FUN. IT CAN TELL US INTERESTING THINGS, FUN THINGS ABOUT OURSELVES, OUR POPULATIONS, AND EVEN ABOUT OUR PHENOTYPES IN GENERAL. FINALLY I WANT TO ACKNOWLEDGE A NUMBER OF MY COLLEAGUES AT THE UNIVERSITY OF UTAH. PEOPLE IN MY LAB AND OTHERS WHO HAVE CONTRIBUTED TO THE WORK I'VE TOLD YOU ABOUT. MOBILE ELEMENT WORK THAT I TOUCHED ON JUST A LITTLE BIT I'VE DONE IN COLLABORATION OVER MANY YEARS NOW WITH MY COLLEAGUE AT LS, MARK BATZER. SOME WERE GATHERED BY THE MOLECULAR GYNECOLOGY FOUNDATION. I WANT TO THANK ALL OF THESE PEOPLE FOR THEIR CONTRIBUTIONS TO OUR RESEARCH I'D LIKE TO THANK ALL OF YOU FOR YOUR ATTENTION. [APPLAUSE] I THINK WE HAVE A COUPLE OF MINUTES FOR QUESTIONS. I'M JUST TOLD THAT USUALLY WE DON'T HAVE QUESTIONS. >> I'LL ASK ONE. THIS FIELD OF ANALYZING LINKAGE DISEQUILIBRIUM, DOES IT WORK FOR VERY, VERY RARE MUTATIONS? I WOULD CONCEIVE THAT IT WOULD HAVE TO BE PRETTY COMMON TO WORK. >> IT COULD WORK FOR A RARE MUTATION THE PROBLEM IS THAT YOU NEED FAIRLY LARGE SAMPLE SIZE AFFECTED INDIVIDUALS WHO ARE UNRELATED, AT LEAST NOT CLOSELY RELATED. IF YOU GOT REALLY RARE MUTATION IT MIGHT BE VERY DIFFICULT TO GET LARGE ENOUGH SAMPLE SAY 50 TO 100 CASES, THAT IS WHERE IT WOULD BE A CHALLENGE. >> ALSO A LOT OF DISEASE CLASSES THAT ARE CAUSED BY MULTIPLE DIFFERENT TYPES OF MUTATIONS LIKE CARDIOMYOPATHYS FOR ONE, IT WOULDN'T BE DIFFICULT TO USE LINKAGE DISEQUILIBRIUM IN SUCH A CASE? >> YEAH. IF THERE IS SUBSTANTIAL ALLELIC HETEROGENEITY. LOOK AT BRCA 1 WHERE THERE ARE HUNDREDS OF DIFFERENT MUTATIONS, EACH OF THOSE OCCURS ON A DIFFERENT HAPLOTYPE BACKGROUND YOU'RE NOT GOING TO SEE A CONSISTENT PATTERN ASSOCIATION. IF THERE'S STRONG ALLELIC HETEROGENEITY LINKAGE DISEQUILIBRIUM BECOMES -- IT ISN'T ALWAYS VERY USEFUL. >> THANKS. >> HAS ANYONE APPLIED PHYLOGENETICS TO SNPS? >> YES. >> ARE THE RESULTS MEANINGFUL OR -- >> THE RESULTS THAT WE SEE, ARE YOU TALKING ABOUT PHYLOGENETICS ACROSS SPECIES? OR WITHIN SPECIES? >> WITHIN SPECIES. >> WE LOOK AT PATTERNS IN HUMANS, THEY'RE VERY CONSISTENT WITH WHAT WE'VE SEEN WITH OTHER MARKERS, WHAT WE'VE SEEN IN WHOLE SEQUENCE, EVERY KIND OF POLYMORPHISM WE LOOK AT AUTOSOMAL POLYMORPHISM GIVES US A PATTERN PHYLOGENETICALLY.
B1 population linkage genome variation ancestry african Introduction to Population Genetics (2010) 77 10 Morris Du posted on 2015/06/29 More Share Save Report Video vocabulary