Publications
Department of Medicine faculty members published more than 3,000 peer-reviewed articles in 2022.
2013
2013
The Caribbean basin is home to some of the most complex interactions in recent history among previously diverged human populations. Here, we investigate the population genetic history of this region by characterizing patterns of genome-wide variation among 330 individuals from three of the Greater Antilles (Cuba, Puerto Rico, Hispaniola), two mainland (Honduras, Colombia), and three Native South American (Yukpa, Bari, and Warao) populations. We combine these data with a unique database of genomic variation in over 3,000 individuals from diverse European, African, and Native American populations. We use local ancestry inference and tract length distributions to test different demographic scenarios for the pre- and post-colonial history of the region. We develop a novel ancestry-specific PCA (ASPCA) method to reconstruct the sub-continental origin of Native American, European, and African haplotypes from admixed genomes. We find that the most likely source of the indigenous ancestry in Caribbean islanders is a Native South American component shared among inland Amazonian tribes, Central America, and the Yucatan peninsula, suggesting extensive gene flow across the Caribbean in pre-Columbian times. We find evidence of two pulses of African migration. The first pulse--which today is reflected by shorter, older ancestry tracts--consists of a genetic component more similar to coastal West African regions involved in early stages of the trans-Atlantic slave trade. The second pulse--reflected by longer, younger tracts--is more similar to present-day West-Central African populations, supporting historical records of later transatlantic deportation. Surprisingly, we also identify a Latino-specific European component that has significantly diverged from its parental Iberian source populations, presumably as a result of small European founder population size. We demonstrate that the ancestral components in admixed genomes can be traced back to distinct sub-continental source populations with far greater resolution than previously thought, even when limited pre-Columbian Caribbean haplotypes have survived.
View on PubMed2013
Haploinsufficiency of the hematopoietic transcription factor GATA2 underlies monocytopenia and mycobacterial infections; dendritic cell, monocyte, B, and natural killer (NK) lymphoid deficiency; familial myelodysplastic syndromes (MDS)/acute myeloid leukemia (AML); and Emberger syndrome (primary lymphedema with MDS). A comprehensive examination of the clinical features of GATA2 deficiency is currently lacking. We reviewed the medical records of 57 patients with GATA2 deficiency evaluated at the National Institutes of Health from January 1, 1992, to March 1, 2013, and categorized mutations as missense, null, or regulatory to identify genotype-phenotype associations. We identified a broad spectrum of disease: hematologic (MDS 84%, AML 14%, chronic myelomonocytic leukemia 8%), infectious (severe viral 70%, disseminated mycobacterial 53%, and invasive fungal infections 16%), pulmonary (diffusion 79% and ventilatory defects 63%, pulmonary alveolar proteinosis 18%, pulmonary arterial hypertension 9%), dermatologic (warts 53%, panniculitis 30%), neoplastic (human papillomavirus+ tumors 35%, Epstein-Barr virus+ tumors 4%), vascular/lymphatic (venous thrombosis 25%, lymphedema 11%), sensorineural hearing loss 76%, miscarriage 33%, and hypothyroidism 14%. Viral infections and lymphedema were more common in individuals with null mutations (P = .038 and P = .006, respectively). Monocytopenia, B, NK, and CD4 lymphocytopenia correlated with the presence of disease (P < .001). GATA2 deficiency unites susceptibility to MDS/AML, immunodeficiency, pulmonary disease, and vascular/lymphatic dysfunction. Early genetic diagnosis is critical to direct clinical management, preventive care, and family screening.
View on PubMed2013
2013
2013
2013
2013
2013
To date, the scientific process for generating, interpreting, and applying knowledge has received less informatics attention than operational processes for conducting clinical studies. The activities of these scientific processes - the science of clinical research - are centered on the study protocol, which is the abstract representation of the scientific design of a clinical study. The Ontology of Clinical Research (OCRe) is an OWL 2 model of the entities and relationships of study design protocols for the purpose of computationally supporting the design and analysis of human studies. OCRe's modeling is independent of any specific study design or clinical domain. It includes a study design typology and a specialized module called ERGO Annotation for capturing the meaning of eligibility criteria. In this paper, we describe the key informatics use cases of each phase of a study's scientific lifecycle, present OCRe and the principles behind its modeling, and describe applications of OCRe and associated technologies to a range of clinical research use cases. OCRe captures the central semantics that underlies the scientific processes of clinical research and can serve as an informatics foundation for supporting the entire range of knowledge activities that constitute the science of clinical research.
View on PubMed