Bonnie L Hurwitz

Associate Professor, Biosystems Engineering
Clinical Instructor, Pharmacy Practice-Science
Associate Professor, Genetics - GIDP
Associate Professor, Statistics-GIDP
Associate Professor, BIO5 Institute
Member of the Graduate Faculty

Contact

(520) 626-9819
SHANTZ, Rm. 403
TUCSON, AZ 85721-0038
bhurwitz@arizona.edu

Biography

Dr. Bonnie Hurwitz is an Associate Professor of Biosystems Engineering at the University of Arizona and Bio5 Research Institute Fellow. She has worked as a computational biologist for two decades on interdisciplinary projects in both industry and academia. Her research on the Earth and human microbiome incorporates large-scale –omics datasets, high-throughput computing, and big data analytics to answer questions in systems biology. In particular, Dr. Hurwitz is interested in how viruses re-engineer host metabolism and the implications on host-driven processes. Dr. Hurwitz is well-cited for her work in computational biology in diverse areas, from plant genomics to viral metagenomics, with over 6500 citations.

Degrees

Ph.D. Ecology and Evolutionary Biology

University of Arizona, Tucson, Arizona, United States
Viral community dynamics and functional specialization in the Pacific Ocean

B.S. Biochemistry and Molecular Biology

University of California, Santa Cruz, Santa Cruz, California, United States

Work Experience

Amazon Web Services (2022 - Ongoing)
University of Arizona, Tucson, Arizona (2020 - Ongoing)
University of Arizona, Tucson, Arizona (2014 - 2020)
University of Arizona, Tucson, Arizona (2012 - 2014)
Cold Spring Harbor Laboratory (2004 - 2008)
Third Wave Technologies (2002 - 2004)
Accelrys (2001 - 2002)
Incyte Genomics (1997 - 2001)

Awards

Amazon Scholar

Amazon Web Services, Spring 2023
Amazon Web Services, Fall 2021

Highly accessed publication in FEMS Microbiology Reviews

FEMS Microbiology Reviews, Summer 2016

Highly Accessed Publication in BMC Medicine

BMC Medicine, Spring 2014

Licensure & Certification

Bioinformatics, University of California Santa Cruz (1999)

Interests

Teaching

metagenomics, genomics, microbes, viruses, bioinformatics, computing, high performance computing

Research

metagenomics, genomics, microbes, viruses, bioinformatics, computing, big data, high-performance computing, cyberinfrastructure, cloud computing

Courses

2025-26 Courses

Intro to BAT using CURE

BAT 102 (Fall 2025)
Metagenomics

BE 487 (Fall 2025)
Metagenomics

BE 587 (Fall 2025)

2024-25 Courses

Applied Cyberinfrastruct Conc

BE 429 (Spring 2025)
Applied Cyberinfrastruct Conc

BE 529 (Spring 2025)
Applied Cyberinfrastruct Conc

INFO 529 (Spring 2025)
Applied Cyberinfrastruct Conc

ISTA 429 (Spring 2025)
Biosystems Analytics

BAT 434 (Spring 2025)
Biosystems Analytics

BE 434 (Spring 2025)
Biosystems Analytics

BE 534 (Spring 2025)
Dissertation

BE 920 (Spring 2025)
Dissertation

GENE 920 (Spring 2025)
Independent Study

BE 599 (Spring 2025)
Thesis

MCB 910 (Spring 2025)
Dissertation

BE 920 (Fall 2024)
Internship

BE 493 (Fall 2024)
Intro to BAT using CURE

BAT 102 (Fall 2024)
Metagenomics

BE 487 (Fall 2024)
Metagenomics

BE 587 (Fall 2024)
Thesis

BE 910 (Fall 2024)

2023-24 Courses

Biosystems Analytics

BAT 434 (Spring 2024)
Biosystems Analytics

BE 434 (Spring 2024)
Biosystems Analytics

BE 534 (Spring 2024)
Curr Top Plant Sci-Adv

PLS 595B (Spring 2024)
Dissertation

BE 920 (Spring 2024)
Dissertation

MCB 920 (Spring 2024)
Independent Study

BE 599 (Spring 2024)
Internship

BE 493 (Spring 2024)
Internship

BE 693 (Spring 2024)
Thesis

BE 910 (Spring 2024)
Dissertation

BE 920 (Fall 2023)
Metagenomics

BE 487 (Fall 2023)
Metagenomics

BE 587 (Fall 2023)
Thesis

BE 910 (Fall 2023)
Thesis

MCB 910 (Fall 2023)

2022-23 Courses

Biosystems Analytics

BAT 434 (Spring 2023)
Biosystems Analytics

BE 434 (Spring 2023)
Biosystems Analytics

BE 534 (Spring 2023)
Dissertation

BE 920 (Spring 2023)
Dissertation

MCB 920 (Spring 2023)
Internship

BE 493 (Spring 2023)
Lab Presentations & Discussion

MCB 696A (Spring 2023)
Dissertation

BE 920 (Fall 2022)
Dissertation

MCB 920 (Fall 2022)
Intro to BAT using CURE

BAT 102 (Fall 2022)
Lab Presentations & Discussion

MCB 696A (Fall 2022)
Metagenomics

BE 487 (Fall 2022)
Metagenomics

BE 587 (Fall 2022)

2021-22 Courses

Dissertation

BE 920 (Spring 2022)
Dissertation

MCB 920 (Spring 2022)
Lab Presentations & Discussion

MCB 696A (Spring 2022)
Dissertation

BE 920 (Fall 2021)
Dissertation

MCB 920 (Fall 2021)
Lab Presentations & Discussion

MCB 696A (Fall 2021)

2020-21 Courses

Dissertation

BE 920 (Spring 2021)
Dissertation

MCB 920 (Spring 2021)
Lab Presentations & Discussion

MCB 696A (Spring 2021)
Dissertation

BE 920 (Fall 2020)
Dissertation

MCB 920 (Fall 2020)
Independent Study

BE 199 (Fall 2020)
Lab Presentations & Discussion

MCB 696A (Fall 2020)
Metagenomics

BE 487 (Fall 2020)
Metagenomics

BE 587 (Fall 2020)

2019-20 Courses

Biosystems Analytics

BE 434 (Spring 2020)
Biosystems Analytics

BE 534 (Spring 2020)
Dissertation

BE 920 (Spring 2020)
Dissertation

MCB 920 (Spring 2020)
Honors Independent Study

BE 299H (Spring 2020)
Internship

BE 693 (Spring 2020)
Lab Presentations & Discussion

MCB 696A (Spring 2020)
Dissertation

BE 920 (Fall 2019)
Dissertation

MCB 920 (Fall 2019)
Independent Study

BE 299 (Fall 2019)
Lab Presentations & Discussion

MCB 696A (Fall 2019)
Metagenomics

BE 587 (Fall 2019)

2018-19 Courses

Biosystems Analytics

BE 434 (Spring 2019)
Biosystems Analytics

BE 534 (Spring 2019)
Directed Research

BE 492 (Spring 2019)
Dissertation

BE 920 (Spring 2019)
Lab Presentations & Discussion

MCB 696A (Spring 2019)
Research

MCB 900 (Spring 2019)
Thesis

BE 910 (Spring 2019)
Dissertation

ABE 920 (Fall 2018)
Honors Thesis

MIC 498H (Fall 2018)
Internship

ABE 493 (Fall 2018)
Internship

ABE 693 (Fall 2018)
Lab Presentations & Discussion

MCB 696A (Fall 2018)
Metagenomics

ABE 487 (Fall 2018)
Metagenomics

ABE 587 (Fall 2018)
Research

MCB 900 (Fall 2018)
Thesis

ABE 910 (Fall 2018)

2017-18 Courses

Dissertation

ABE 920 (Spring 2018)
Introduction to Research

MCB 795A (Spring 2018)
Dissertation

ABE 920 (Fall 2017)
Introduction to Research

MCB 795A (Fall 2017)
Metagenomics

ABE 487 (Fall 2017)
Metagenomics

ABE 587 (Fall 2017)
Thesis

ABE 910 (Fall 2017)

2016-17 Courses

Dissertation

ABE 920 (Spring 2017)
Dissertation

MCB 920 (Spring 2017)
Internship

ABE 393 (Spring 2017)
Internship

ABE 693 (Spring 2017)
Lab Presentations & Discussion

MCB 696A (Spring 2017)
Thesis

ABE 910 (Spring 2017)
Dissertation

ABE 920 (Fall 2016)
Dissertation

MCB 920 (Fall 2016)
Independent Study

ABE 599 (Fall 2016)
Internship

ABE 393 (Fall 2016)
Internship

ABE 593 (Fall 2016)
Internship

ABE 693 (Fall 2016)
Metagenomics

ABE 487 (Fall 2016)
Metagenomics

ABE 587 (Fall 2016)
Thesis

ABE 910 (Fall 2016)

2015-16 Courses

Internship

MCB 693 (Summer I 2016)
Dissertation

ABE 920 (Spring 2016)
Dissertation

MCB 920 (Spring 2016)
Lab Presentations & Discussion

MCB 696A (Spring 2016)
Thesis

STAT 910 (Spring 2016)

Scholarly Contributions

Chapters

Spichler-Moffrah, A., Al Mohajer, M., Hurwitz, B. L., & Armstrong, D. G. (2016). Skin and Soft Tissue Infection. In ASM: Diagnostic Microbiology of the Immunocompromised Host, Hayden RT, Carroll KC, Tang TW, Wolk DM (eds). Washington, DC: American Society of Microbiology Press.

Journals/Publications

Blumberg, K., Miller, M., Ponsero, A., & Hurwitz, B. (2022). Ontology-driven analysis of marine metagenomics: what more can we learn from our data?. GigaScience, 12.
More info
The proliferation of metagenomic sequencing technologies has enabled novel insights into the functional genomic potentials and taxonomic structure of microbial communities. However, cyberinfrastructure efforts to manage and enable the reproducible analysis of sequence data have not kept pace. Thus, there is increasing recognition of the need to make metagenomic data discoverable within machine-searchable frameworks compliant with the FAIR (Findability, Accessibility, Interoperability, and Reusability) principles for data stewardship. Although a variety of metagenomic web services exist, none currently leverage the hierarchically structured terminology encoded within common life science ontologies to programmatically discover data.
Ponsero, A. J., Miller, M., & Hurwitz, B. L. (2023). Comparison of k-mer-based comparative metagenomic tools and approaches. Microbiome research reports, 2(4), 27.
More info
Comparative metagenomic analysis requires measuring a pairwise similarity between metagenomes in the dataset. Reference-based methods that compute a beta-diversity distance between two metagenomes are highly dependent on the quality and completeness of the reference database, and their application on less studied microbiota can be challenging. On the other hand, comparative metagenomic methods only rely on the sequence composition of metagenomes to compare datasets. While each one of these approaches has its strengths and limitations, their comparison is currently limited. We developed sets of simulated short-reads metagenomes to (1) compare k-mer-based and taxonomy-based distances and evaluate the impact of technical and biological variables on these metrics and (2) evaluate the effect of k-mer sketching and filtering. We used a real-world metagenomic dataset to provide an overview of the currently available tools for metagenomic comparative analysis. Using simulated metagenomes of known composition and controlled error rate, we showed that k-mer-based distance metrics were well correlated to the taxonomic distance metric for quantitative Beta-diversity metrics, but the correlation was low for presence/absence distances. The community complexity in terms of taxa richness and the sequencing depth significantly affected the quality of the k-mer-based distances, while the impact of low amounts of sequence contamination and sequencing error was limited. Finally, we benchmarked currently available comparative metagenomic tools and compared their output on two datasets of fecal metagenomes and showed that most k-mer-based tools were able to recapitulate the data structure observed using taxonomic approaches. This study expands our understanding of the strength and limitations of k-mer-based comparative metagenomic approaches and aims to provide concrete guidelines for researchers interested in applying these approaches to their metagenomic datasets.
Schackart, K. E., Graham, J. B., Ponsero, A. J., & Hurwitz, B. L. (2023). Evaluation of computational phage detection tools for metagenomic datasets. Frontiers in microbiology, 14, 1078760.
More info
As new computational tools for detecting phage in metagenomes are being rapidly developed, a critical need has emerged to develop systematic benchmarks.
Liu, C., Ram, S., & Hurwitz, B. L. (2022). Network analysis reveals dysregulated functional patterns in type II diabetic skin. Scientific reports, 12(1), 6889.
More info
Skin disorders are one of the most common complications of type II diabetes (T2DM). Long-term effects of high blood glucose leave individuals with T2DM more susceptible to cutaneous diseases, but its underlying molecular mechanisms are unclear. Network-based methods consider the complex interactions between genes which can complement the analysis of single genes in previous research. Here, we use network analysis and topological properties to systematically investigate dysregulated gene co-expression patterns in type II diabetic skin with skin samples from the Genotype-Tissue Expression database. Our final network consisted of 8812 genes from 73 subjects with T2DM and 147 non-T2DM subjects matched for age, sex, and race. Two gene modules significantly related to T2DM were functionally enriched in the pathway lipid metabolism, activated by PPARA and SREBF (SREBP). Transcription factors KLF10, KLF4, SP1, and microRNA-21 were predicted to be important regulators of gene expression in these modules. Intramodular analysis and betweenness centrality identified NCOA6 as the hub gene while KHSRP and SIN3B are key coordinators that influence molecular activities differently between T2DM and non-T2DM populations. We built a TF-miRNA-mRNA regulatory network to reveal the novel mechanism (miR-21-PPARA-NCOA6) of dysregulated keratinocyte proliferation, differentiation, and migration in diabetic skin, which may provide new insights into the susceptibility of skin disorders in T2DM patients. Hub genes and key coordinators may serve as therapeutic targets to improve diabetic skincare.
Blumberg, K. L., Ponsero, A. J., Bomhoff, M., Wood-Charlson, E. M., DeLong, E. F., & Hurwitz, B. L. (2021). Ontology-Enriched Specifications Enabling Findable, Accessible, Interoperable, and Reusable Marine Metagenomic Datasets in Cyberinfrastructure Systems. Frontiers in microbiology, 12, 765268.
More info
Marine microbial ecology requires the systematic comparison of biogeochemical and sequence data to analyze environmental influences on the distribution and variability of microbial communities. With ever-increasing quantities of metagenomic data, there is a growing need to make datasets Findable, Accessible, Interoperable, and Reusable (FAIR) across diverse ecosystems. FAIR data is essential to developing analytical frameworks that integrate microbiological, genomic, ecological, oceanographic, and computational methods. Although community standards defining the minimal metadata required to accompany sequence data exist, they haven't been consistently used across projects, precluding interoperability. Moreover, these data are not machine-actionable or discoverable by cyberinfrastructure systems. By making 'omic and physicochemical datasets FAIR to machine systems, we can enable sequence data discovery and reuse based on machine-readable descriptions of environments or physicochemical gradients. In this work, we developed a novel technical specification for dataset encapsulation for the FAIR reuse of marine metagenomic and physicochemical datasets within cyberinfrastructure systems. This includes using Frictionless Data Packages enriched with terminology from environmental and life-science ontologies to annotate measured variables, their units, and the measurement devices used. This approach was implemented in Planet Microbe, a cyberinfrastructure platform and marine metagenomic web-portal. Here, we discuss the data properties built into the specification to make global ocean datasets FAIR within the Planet Microbe portal. We additionally discuss the selection of, and contributions to marine-science ontologies used within the specification. Finally, we use the system to discover data by which to answer various biological questions about environments, physicochemical gradients, and microbial communities in meta-analyses. This work represents a future direction in marine metagenomic research by proposing a specification for FAIR dataset encapsulation that, if adopted within cyberinfrastructure systems, would automate the discovery, exchange, and re-use of data needed to answer broader reaching questions than originally intended.
Lopez-Pier, M. A., Koppinger, M. P., Harris, P. R., Cannon, D. K., Skaria, R. S., Hurwitz, B. L., Watts, G., Aras, S., Slepian, M. J., & Konhilas, J. P. (2021). An adaptable and non-invasive method for tracking Bifidobacterium animalis subspecies lactis 420 in the mouse gut. Journal of microbiological methods, 189, 106302.
More info
Probiotic strains from the Bifidobacterium or Lactobacillus genera improve health outcomes in models of metabolic and cardiovascular disease. Yet, underlying mechanisms governing these improved health outcomes are rooted in the interaction of gut microbiota, intestinal interface, and probiotic strain. Central to defining the underlying mechanisms governing these improved health outcomes is the development of adaptable and non-invasive tools to study probiotic localization and colonization within the host gut microbiome. The objective of this study was to test labeling and tracking efficacy of Bifidobacterium animalis subspecies lactis 420 (B420) using a common clinical imaging agent, indocyanine green (ICG). ICG was an effective in situ labeling agent visualized in either intact mouse or excised gastrointestinal (GI) tract at different time intervals. Quantitative PCR was used to validate ICG visualization of B420, which also demonstrated that B420 transit time matched normal murine GI motility (~8 hours). Contrary to previous thoughts, B420 did not colonize any region of the GI tract whether following a single bolus or daily administration for up to 10 days. We conclude that ICG may provide a useful tool to visualize and track probiotic species such as B420 without implementing complex molecular and genetic tools. Proof-of-concept studies indicate that B420 did not colonize and establish residency align the murine GI tract.
Ponsero, A. J., Hurwitz, B. L., Magain, N., Miadlikowska, J., Lutzoni, F., & U'Ren, J. M. (2021). Cyanolichen microbiome contains novel viruses that encode genes to promote microbial metabolism. ISME communications, 1(1), 56.
More info
Lichen thalli are formed through the symbiotic association of a filamentous fungus and photosynthetic green alga and/or cyanobacterium. Recent studies have revealed lichens also host highly diverse communities of secondary fungal and bacterial symbionts, yet few studies have examined the viral component within these complex symbioses. Here, we describe viral biodiversity and functions in cyanolichens collected from across North America and Europe. As current machine-learning viral-detection tools are not trained on complex eukaryotic metagenomes, we first developed efficient methods to remove eukaryotic reads prior to viral detection and a custom pipeline to validate viral contigs predicted with three machine-learning methods. Our resulting high-quality viral data illustrate that every cyanolichen thallus contains diverse viruses that are distinct from viruses in other terrestrial ecosystems. In addition to cyanobacteria, predicted viral hosts include other lichen-associated bacterial lineages and algae, although a large fraction of viral contigs had no host prediction. Functional annotation of cyanolichen viral sequences predicts numerous viral-encoded auxiliary metabolic genes (AMGs) involved in amino acid, nucleotide, and carbohydrate metabolism, including AMGs for secondary metabolism (antibiotics and antimicrobials) and fatty acid biosynthesis. Overall, the diversity of cyanolichen AMGs suggests that viruses may alter microbial interactions within these complex symbiotic assemblages.
Steiner, H. E., Gee, K., Giles, J., Knight, H., Hurwitz, B. L., & Karnes, J. H. (2021). Role of the gut microbiome in cardiovascular drug response: The potential for clinical application. Pharmacotherapy.
More info
Response to cardiovascular drugs can vary greatly between individuals, and the role of the microbiome in this variability is being increasingly appreciated. Recent evidence indicates that bacteria and other microbes are responsible for direct and indirect effects on drug efficacy and toxicity. Pharmacomicrobiomics aims to uncover variability in drug response due to microbes in the human body, which may alter drug disposition through microbial metabolism, interference by microbial metabolites, or modification of host enzymes. In this review, we present recent advances in our understanding of the interplay between microbes, host metabolism, and cardiovascular drugs. We report numerous cardiovascular drugs with evidence of, or potential for, gut-microbe interactions. However, the effects of gut microbiota on many cardiovascular drugs are yet uninvestigated. Finally, we consider potential clinical applications for the described findings.
Vangay, P., Burgin, J., Johnston, A., Beck, K. L., Berrios, D. C., Blumberg, K., Canon, S., Chain, P., Chandonia, J. M., Christianson, D., Costes, S. V., Damerow, J., Duncan, W. D., Dundore-Arias, J. P., Fagnan, K., Galazka, J. M., Gibbons, S. M., Hays, D., Hervey, J., , Hu, B., et al. (2021). Correction for Vangay et al., "Microbiome Metadata Standards: Report of the National Microbiome Data Collaborative's Workshop and Follow-On Activities". mSystems, 6(3).
Vangay, P., Burgin, J., Johnston, A., Beck, K. L., Berrios, D. C., Blumberg, K., Canon, S., Chain, P., Chandonia, J. M., Christianson, D., Costes, S. V., Damerow, J., Duncan, W. D., Dundore-Arias, J. P., Fagnan, K., Galazka, J. M., Gibbons, S. M., Hays, D., Hervey, J., , Hu, B., et al. (2021). Microbiome Metadata Standards: Report of the National Microbiome Data Collaborative's Workshop and Follow-On Activities. mSystems, 6(1).
More info
Microbiome samples are inherently defined by the environment in which they are found. Therefore, data that provide context and enable interpretation of measurements produced from biological samples, often referred to as metadata, are critical. Important contributions have been made in the development of community-driven metadata standards; however, these standards have not been uniformly embraced by the microbiome research community. To understand how these standards are being adopted, or the barriers to adoption, across research domains, institutions, and funding agencies, the National Microbiome Data Collaborative (NMDC) hosted a workshop in October 2019. This report provides a summary of discussions that took place throughout the workshop, as well as outcomes of the working groups initiated at the workshop.
Hurwitz, B. L. (2020). Future Directions of the Cyberinfrastructure for Sustained Scientific Innovation (CSSI) Program. CSSI 2019 Workshop.
More info
The CSSI 2019 workshop was held on October 28-29, 2019, in Austin, Texas. The main objectives of this workshop were to (1) understand the impact of the CSSI program on the community over the last 9 years, (2) engage workshop participants in identifying gaps and opportunities in the current CSSI landscape, (3) gather ideas on the cyberinfrastructure needs and expectations of the community with respect to the CSSI program, and (4) prepare a report summarizing the feedback gathered from the community that can inform the future solicitations of the CSSI program. The workshop brought together different stakeholders interested in provisioning sustainable cyberinfrastructure that can power discoveries impacting the various fields of science and technology and maintaining the nation's competitiveness in the areas such as scientific software, HPC, networking, cybersecurity, and data/information science. The workshop served as a venue for gathering the community-feedback on the current state of the CSSI program and its future directions. [Journal_ref: ]
Hurwitz, B. L., & Watts, G. S. (2020). Metagenomic Next-Generation Sequencing in Clinical Microbiology. Clinical Microbiology Newsletter, 42(7). doi:https://doi.org/10.1016/j.clinmicnews.2020.03.004
More info
Invited Editorial
Liu, C., Ponsero, A. J., Armstrong, D. G., Lipsky, B. A., & Hurwitz, B. L. (2020). The dynamic wound microbiome. BMC medicine, 18(1), 358.
More info
Diabetic foot ulcers (DFUs) account for the majority of all limb amputations and hospitalizations due to diabetes complications. With 30 million cases of diabetes in the USA and 500,000 new diagnoses each year, DFUs are a growing health problem. Diabetes patients with limb amputations have high postoperative mortality, a high rate of secondary amputation, prolonged inpatient hospital stays, and a high incidence of re-hospitalization. DFU-associated amputations constitute a significant burden on healthcare resources that cost more than 10 billion dollars per year. Currently, there is no way to identify wounds that will heal versus those that will become severely infected and require amputation.
Pace, J., Youens-Clark, K., Freeman, C., Hurwitz, B., & Van Doorslaer, K. (2020). PuMA: A papillomavirus genome annotation tool. Virus evolution, 6(2), veaa068.
More info
High-throughput sequencing technologies provide unprecedented power to identify novel viruses from a wide variety of (environmental) samples. The field of 'viral metagenomics' has dramatically expanded our understanding of viral diversity. Viral metagenomic approaches imply that many novel viruses will not be described by researchers who are experts on (the genomic organization of) that virus family. We have developed the papillomavirus annotation tool (PuMA) to provide researchers with a convenient and reproducible method to annotate and report novel papillomaviruses. PuMA currently correctly annotates 99% of the papillomavirus genes when benchmarked against the 655 reference genomes in the papillomavirus episteme. Compared to another viral annotation pipeline, PuMA annotates more viral features while being more accurate. To demonstrate its general applicability, we also developed a preliminary version of PuMA that can annotate polyomaviruses. PuMA is available on GitHub (https://github.com/KVD-lab/puma) and through the iMicrobe online environment (https://www.imicrobe.us/#/apps/puma).
Ponsero, A. J., Bomhoff, M., Blumberg, K., Youens-Clark, K., Herz, N. M., Wood-Charlson, E. M., Delong, E. F., & Hurwitz, B. L. (2021). Planet Microbe: a platform for marine microbiology to discover and analyze interconnected 'omics and environmental data. Nucleic acids research, 49(D1), D792-D802.
More info
In recent years, large-scale oceanic sequencing efforts have provided a deeper understanding of marine microbial communities and their dynamics. These research endeavors require the acquisition of complex and varied datasets through large, interdisciplinary and collaborative efforts. However, no unifying framework currently exists for the marine science community to integrate sequencing data with physical, geological, and geochemical datasets. Planet Microbe is a web-based platform that enables data discovery from curated historical and on-going oceanographic sequencing efforts. In Planet Microbe, each 'omics sample is linked with other biological and physiochemical measurements collected for the same water samples or during the same sample collection event, to provide a broader environmental context. This work highlights the need for curated aggregation efforts that can enable new insights into high-quality metagenomic datasets. Planet Microbe is freely accessible from https://www.planetmicrobe.org/.
Schriml, L. M., Chuvochina, M., Davies, N., Eloe-Fadrosh, E. A., Finn, R. D., Hugenholtz, P., Hunter, C. I., Hurwitz, B. L., Kyrpides, N. C., Meyer, F., Mizrachi, I. K., Sansone, S. A., Sutton, G., Tighe, S., & Walls, R. (2020). COVID-19 pandemic reveals the peril of ignoring metadata standards. Scientific data, 7(1), 188.
Watts, G. S., & Hurwitz, B. L. (2020). Metagenomic Next-Generation Sequencing in Clinical Microbiology. Clinical Microbiology Newsletter, 42(7), 53 - 59.
Daniel, S. G., Ball, C. L., Besselsen, D. G., Doetschman, T., & Hurwitz, B. L. (2019). Functional Changes in the Gut Microbiome Contribute to Transforming Growth Factor β-Deficient Colon Cancer. mSystems, 2(5).
More info
Colorectal cancer (CRC) is one of the most treatable cancers, with a 5-year survival rate of ~64%, yet over 50,000 deaths occur yearly in the United States. In 15% of cases, deficiency in mismatch repair leads to null mutations in transforming growth factor β (TGF-β) type II receptor, yet genotype alone is not responsible for tumorigenesis. Previous work in mice shows that disruptions in TGF-β signaling combined with cause tumorigenesis, indicating a synergistic effect between genotype and microbial environment. Here, we examine functional shifts in the gut microbiome in CRC using integrated -omics approaches to untangle the role of host genotype, inflammation, and microbial ecology. We profile the gut microbiome of 40 mice with/without deficiency in TGF-β signaling from a (mothers against decapentaplegic homolog-3) knockout and with/without inoculation with . Clear functional differences in the microbiome tied to specific bacterial species emerge from four pathways related to human colon cancer: lipopolysaccharide (LPS) production, polyamine synthesis, butyrate metabolism, and oxidative phosphorylation (OXPHOS). Specifically, an increase in drives LPS production, which is associated with an inflammatory response. We observe a commensurate decrease in butyrate production from bacterium A4, which could promote tumor formation. causes an increase in OXPHOS that may increase DNA-damaging free radicals. Finally, multiple bacterial species increase polyamines that are associated with colon cancer, implicating not just diet but also the microbiome in polyamine levels. These insights into cross talk between the microbiome, host genotype, and inflammation could promote the development of diagnostics and therapies for CRC. Most research on the gut microbiome in colon cancer focuses on taxonomic changes at the genus level using 16S rRNA gene sequencing. Here, we develop a new methodology to integrate DNA and RNA data sets to examine functional shifts at the species level that are important to tumor development. We uncover several metabolic pathways in the microbiome that, when perturbed by host genetics and inoculation, contribute to colon cancer. The work presented here lays a foundation for improved bioinformatics methodologies to closely examine the cross talk between specific organisms and the host, important for the development of diagnostics and pre/probiotic treatment.
Ponsero, A. J., & Hurwitz, B. L. (2019). The Promises and Pitfalls of Machine Learning for Detecting Viruses in Aquatic Metagenomes. Frontiers in microbiology, 10, 806.
More info
Tools allowing for the identification of viral sequences in host-associated and environmental metagenomes allows for a better understanding of the genetics and ecology of viruses and their hosts. Recently, new approaches using machine learning methods to distinguish viral from bacterial signal using k-mer sequence signatures were published for identifying viral contigs in metagenomes. The promise of these content-based approaches is the ability to discover new viruses, with no or few known relatives. In this perspective paper, we examine the use of the content-based machine learning tool VirFinder for the identification of viral sequences in aquatic metagenomes and explore the possibility of using ecosystem-focused models targeted to marine metagenomes. We discuss the impact of the training set composition on the tool performance and the current limitation for the retrieval of low abundance viral sequences in metagenomes. We identify potential biases that could arise from machine learning approaches for viral hunting in real-world datasets and suggest possible avenues to overcome them.
Watts, G. S., Thornton, J. E., Youens-Clark, K., Ponsero, A. J., Slepian, M. J., Menashi, E., Hu, C., Deng, W., Armstrong, D. G., Reed, S., Cranmer, L. D., & Hurwitz, B. L. (2019). Identification and quantitation of clinically relevant microbes in patient samples: Comparison of three k-mer based classifiers for speed, accuracy, and sensitivity. PLoS computational biology, 15(11), e1006863.
More info
Infections are a serious health concern worldwide, particularly in vulnerable populations such as the immunocompromised, elderly, and young. Advances in metagenomic sequencing availability, speed, and decreased cost offer the opportunity to supplement or even replace culture-based identification of pathogens with DNA sequence-based diagnostics. Adopting metagenomic analysis for clinical use requires that all aspects of the workflow are optimized and tested, including data analysis and computational time and resources. We tested the accuracy, sensitivity, and resource requirements of three top metagenomic taxonomic classifiers that use fast k-mer based algorithms: Centrifuge, CLARK, and KrakenUniq. Binary mixtures of bacteria showed all three reliably identified organisms down to 1% relative abundance, while only the relative abundance estimates of Centrifuge and CLARK were accurate. All three classifiers identified the organisms present in their default databases from a mock bacterial community of 20 organisms, but only Centrifuge had no false positives. In addition, Centrifuge required far less computational resources and time for analysis. Centrifuge analysis of metagenomes obtained from samples of VAP, infected DFUs, and FN showed Centrifuge identified pathogenic bacteria and one virus that were corroborated by culture or a clinical PCR assay. Importantly, in both diabetic foot ulcer patients, metagenomic sequencing identified pathogens 4-6 weeks before culture. Finally, we show that Centrifuge results were minimally affected by elimination of time-consuming read quality control and host screening steps.
Youens-Clark, K., Bomhoff, M., Ponsero, A. J., Wood-Charlson, E. M., Lynch, J., Choi, I., Hartman, J. H., & Hurwitz, B. L. (2019). Erratum to: iMicrobe: Tools and data-driven discovery platform for the microbiome sciences. GigaScience, 8(8).
Youens-Clark, K., Bomhoff, M., Ponsero, A. J., Wood-Charlson, E. M., Lynch, J., Choi, I., Hartman, J. H., & Hurwitz, B. L. (2019). iMicrobe: Tools and data-dreaiven discovery platform for the microbiome sciences. GigaScience, 8(7).
More info
Scientists have amassed a wealth of microbiome datasets, making it possible to study microbes in biotic and abiotic systems on a population or planetary scale; however, this potential has not been fully realized given that the tools, datasets, and computation are available in diverse repositories and locations. To address this challenge, we developed iMicrobe.us, a community-driven microbiome data marketplace and tool exchange for users to integrate their own data and tools with those from the broader community.
Choi, I., Ponsero, A. J., Bomhoff, M., Youens-Clark, K., Hartman, J. H., & Hurwitz, B. L. (2018). Libra: scalable k-mer based tool for massive all-vs-all metagenome comparisons. GigaScience.
More info
Shotgun metagenomics provides powerful insights into microbial community biodiversity and function. Yet, inferences from metagenomic studies are often limited by dataset size and complexity and are restricted by the availability and completeness of existing databases. De novo comparative metagenomics enables the comparison of metagenomes based on their total genetic content.
Choi, I., Ponsero, A. J., Youens-Clark, K., Bomhoff, M., Hartman, J. H., & Hurwitz, B. L. (2018). Libra: scalable k-mer based tool for massive all-vs-all metagenome comparisons. GigaScience.
Hurwitz, B. L. (2018). The Relationship of Host Genetics and the Microbiome in Colon Cancer.. Journal of Animal Science, 96(suppl_2), 15-15. doi:10.1093/jas/sky073.026
Roux, S., Adriaenssens, E. M., Dutilh, B. E., Koonin, E. V., Kropinski, A. M., Krupovic, M., Kuhn, J. H., Lavigne, R., Brister, J. R., Varsani, A., Amid, C., Aziz, R. K., Bordenstein, S. R., Bork, P., Breitbart, M., Cochrane, G. R., Daly, R. A., Desnues, C., Duhaime, M. B., , Emerson, J. B., et al. (2018). Minimum Information about an Uncultivated Virus Genome (MIUViG). Nature biotechnology.
More info
We present an extension of the Minimum Information about any (x) Sequence (MIxS) standard for reporting sequences of uncultivated virus genomes. Minimum Information about an Uncultivated Virus Genome (MIUViG) standards were developed within the Genomic Standards Consortium framework and include virus origin, genome quality, genome annotation, taxonomic classification, biogeographic distribution and in silico host prediction. Community-wide adoption of MIUViG standards, which complement the Minimum Information about a Single Amplified Genome (MISAG) and Metagenome-Assembled Genome (MIMAG) standards for uncultivated bacteria and archaea, will improve the reporting of uncultivated virus genomes in public databases. In turn, this should enable more robust comparative studies and a systematic exploration of the global virosphere.
Alberti, A., Poulain, J., Engelen, S., Labadie, K., Romac, S., Ferrera, I., Albini, G., Aury, J. M., Belser, C., Bertrand, A., Cruaud, C., Da Silva, C., Dossat, C., Gavory, F., Gas, S., Guy, J., Haquelle, M., Jacoby, E., Jaillon, O., , Lemainque, A., et al. (2017). Viral to metazoan marine plankton nucleotide sequences from the Tara Oceans expedition. Scientific data, 4, 170093.
More info
A unique collection of oceanic samples was gathered by the Tara Oceans expeditions (2009-2013), targeting plankton organisms ranging from viruses to metazoans, and providing rich environmental context measurements. Thanks to recent advances in the field of genomics, extensive sequencing has been performed for a deep genomic analysis of this huge collection of samples. A strategy based on different approaches, such as metabarcoding, metagenomics, single-cell genomics and metatranscriptomics, has been chosen for analysis of size-fractionated plankton communities. Here, we provide detailed procedures applied for genomic data generation, from nucleic acids extraction to sequence production, and we describe registries of genomics datasets available at the European Nucleotide Archive (ENA, www.ebi.ac.uk/ena). The association of these metadata to the experimental procedures applied for their generation will help the scientific community to access these data and facilitate their analysis. This paper complements other efforts to provide a full description of experiments and open science resources generated from the Tara Oceans project, further extending their value for the study of the world's planktonic ecosystems.
Brum, J. R., Hurwitz, B. L., Schofield, O., Ducklow, H. W., & Sullivan, M. B. (2017). Seasonal time bombs: dominant temperate viruses affect Southern Ocean microbial dynamics. The ISME journal, 11(2), 588.
Eizenga, G. C., Sanchez, P. L., Jackson, A. K., Edwards, J. D., Hurwitz, B. L., Wing, R. A., & Kudrna, D. (2017). Genetic variation for domestication-related traits revealed in a cultivated rice, Nipponbare (Oryza sativa ssp. japonica) x ancestral rice, O-nivara, mapping population. MOLECULAR BREEDING, 37(11).
Hurwitz, B. L., Ponsero, A., Thornton, J., & U'Ren, J. M. (2017). Phage hunters: Computational strategies for finding phages in large-scale 'omics datasets. Virus research, 244, 110-115.
More info
A plethora of tools exist for identifying phage sequences in bacterial genomes, single cell amplified genomes, and host-associated and environmental metagenomes. Yet because the genetics of phages and their hosts are closely intertwined, distinguishing viral from bacterial signal remains an ongoing challenge. Further the size, quantity and fragmentary nature of modern 'omics datasets ushers in a new set of computational challenges. Here, we detail the promises and pitfalls of using currently available gene-centric or k-mer based tools for identifying prophage sequences in genomes and prophage and viral contigs in metagenomes. Each of these methods offers a unique piece of the puzzle to elucidating the intriguing signatures of phage-host coevolution.
Hurwitz, B., Cranmer, L., Slepian, M., Watts, G., Youens‐Clark, K., Wolk, D., Oshiro, M., Metzger, G., & Dhingra, D. (2017). 16S rRNA gene sequencing on a benchtop sequencer: accuracy for identification of clinically important bacteria. Journal of Applied Microbiology, 123(6), 1584-1596. doi:10.1111/jam.13590
Watts, G. S., Youens-Clark, K., Slepian, M. J., Wolk, D. M., Oshiro, M. M., Metzger, G. S., Dhingra, D., Cranmer, L. D., & Hurwitz, B. L. (2017). 16S rRNA gene sequencing on a benchtop sequencer: accuracy for identification of clinically important bacteria. Journal of applied microbiology, 123(6), 1584-1596.
More info
Test the choice of 16S rRNA gene amplicon and data analysis method on the accuracy of identification of clinically important bacteria utilizing a benchtop sequencer.
Bolduc, B., Youens-Clark, K., Roux, S., Hurwitz, B. L., & Sullivan, M. B. (2017). iVirus: facilitating new insights in viral ecology with software and community data sets imbedded in a cyberinfrastructure. The ISME journal, 11(1), 7-14.
More info
Microbes affect nutrient and energy transformations throughout the world's ecosystems, yet they do so under viral constraints. In complex communities, viral metagenome (virome) sequencing is transforming our ability to quantify viral diversity and impacts. Although some bottlenecks, for example, few reference genomes and nonquantitative viromics, have been overcome, the void of centralized data sets and specialized tools now prevents viromics from being broadly applied to answer fundamental ecological questions. Here we present iVirus, a community resource that leverages the CyVerse cyberinfrastructure to provide access to viromic tools and data sets. The iVirus Data Commons contains both raw and processed data from 1866 samples and 73 projects derived from global ocean expeditions, as well as existing and legacy public repositories. Through the CyVerse Discovery Environment, users can interrogate these data sets using existing analytical tools (software applications known as 'Apps') for assembly, open reading frame prediction and annotation, as well as several new Apps specifically developed for analyzing viromes. Because Apps are web based and powered by CyVerse supercomputing resources, they enable scalable analyses for a broad user base. Finally, a use-case scenario documents how to apply these advances toward new data. This growing iVirus resource should help researchers utilize viromics as yet another tool to elucidate viral roles in nature.
Brum, J. R., Hurwitz, B. L., Schofield, O., Ducklow, H. W., & Sullivan, M. B. (2016). Seasonal time bombs: dominant temperate viruses affect Southern Ocean microbial dynamics. The ISME journal, 10(2), 437-49.
More info
Rapid warming in the highly productive western Antarctic Peninsula (WAP) region of the Southern Ocean has affected multiple trophic levels, yet viral influences on microbial processes and ecosystem function remain understudied in the Southern Ocean. Here we use cultivation-independent quantitative ecological and metagenomic assays, combined with new comparative bioinformatic techniques, to investigate double-stranded DNA viruses during the WAP spring-summer transition. This study demonstrates that (i) temperate viruses dominate this region, switching from lysogeny to lytic replication as bacterial production increases, and (ii) Southern Ocean viral assemblages are genetically distinct from lower-latitude assemblages, primarily driven by this temperate viral dominance. This new information suggests fundamentally different virus-host interactions in polar environments, where intense seasonal changes in bacterial production select for temperate viruses because of increased fitness imparted by the ability to switch replication strategies in response to resource availability. Further, temperate viral dominance may provide mechanisms (for example, bacterial mortality resulting from prophage induction) that help explain observed temporal delays between, and lower ratios of, bacterial and primary production in polar versus lower-latitude marine ecosystems. Together these results suggest that temperate virus-host interactions are critical to predicting changes in microbial dynamics brought on by warming in polar marine systems.
Hurwitz, B. L., & U'Ren, J. M. (2016). Viral metabolic reprogramming in marine ecosystems. Current opinion in microbiology, 31, 161-8.
More info
Marine viruses often contain host-derived metabolic genes (i.e., auxiliary metabolic genes; AMGs), which are hypothesized to increase viral replication by augmenting key steps in host metabolism. Currently described AMGs encompass a wide variety of metabolic functions, including amino acid and carbohydrate metabolism, energy production, and iron-sulfur cluster assembly and modification, and their community-wide gene content and abundance vary as a function of environmental conditions. Here, we describe different AMGs classes, their hypothesized role in redirecting host carbon metabolism, and their ecological importance. Focusing on metagenomic ocean surveys, we propose a new model where a suite of phage-encoded genes activate host pathways that respond rapidly to environmental cues, presumably resulting in rapid changes to host metabolic flux for phage production.
Hurwitz, B. L., U'Ren, J. M., & Youens-Clark, K. (2016). Computational prospecting the great viral unknown. FEMS microbiology letters, 363(10).
More info
Bacteriophages play an important role in host-driven biological processes by controlling bacterial population size, horizontally transferring genes between hosts and expressing host-derived genes to alter host metabolism. Metagenomics provides the genetic basis for understanding the interplay between uncultured bacteria, their phage and the environment. In particular, viral metagenomes (viromes) are providing new insight into phage-encoded host genes (i.e. auxiliary metabolic genes; AMGs) that reprogram host metabolism during infection. Yet, despite deep sequencing efforts of viral communities, the majority of sequences have no match to known proteins. Reference-independent computational techniques, such as protein clustering, contig spectra and ecological profiling are overcoming these barriers to examine both the known and unknown components of viromes. As the field of viral metagenomics progresses, a critical assessment of tools is required as the majority of algorithms have been developed for analyzing bacteria. The aim of this paper is to offer an overview of current computational methodologies for virome analysis and to provide an example of reference-independent approaches using human skin viromes. Additionally, we present methods to carefully validate AMGs from host contamination. Despite computational challenges, these new methods offer novel insights into the diversity and functional roles of phages in diverse environments.
Kindler, L., Stoliartchouk, A., Teytelman, L., & Hurwitz, B. L. (2016). Method-centered digital communities on protocols.io for fast-paced scientific innovation. F1000Research, 5, 2271.
More info
The Internet has enabled online social interaction for scientists beyond physical meetings and conferences. Yet despite these innovations in communication, dissemination of methods is often relegated to just academic publishing. Further, these methods remain static, with subsequent advances published elsewhere and unlinked. For communities undergoing fast-paced innovation, researchers need new capabilities to share, obtain feedback, and publish methods at the forefront of scientific development. For example, a renaissance in virology is now underway given the new metagenomic methods to sequence viral DNA directly from an environment. Metagenomics makes it possible to "see" natural viral communities that could not be previously studied through culturing methods. Yet, the knowledge of specialized techniques for the production and analysis of viral metagenomes remains in a subset of labs. This problem is common to any community using and developing emerging technologies and techniques. We developed new capabilities to create virtual communities in protocols.io, an open access platform, for disseminating protocols and knowledge at the forefront of scientific development. To demonstrate these capabilities, we present a virology community forum called VERVENet. These new features allow virology researchers to share protocols and their annotations and optimizations, connect with the broader virtual community to share knowledge, job postings, conference announcements through a common online forum, and discover the current literature through personalized recommendations to promote discussion of cutting edge research. Virtual communities in protocols.io enhance a researcher's ability to: discuss and share protocols, connect with fellow community members, and learn about new and innovative research in the field. The web-based software for developing virtual communities is free to use on protocols.io. Data are available through public APIs at protocols.io.
Moffarah, A. S., Al Mohajer, M., Hurwitz, B. L., & Armstrong, D. G. (2016). Skin and Soft Tissue Infections. Microbiology spectrum, 4(4).
More info
The skin is colonized by a diverse collection of microorganisms which, for the most part, peacefully coexist with their hosts. Skin and soft tissue infections (SSTIs) encompass a variety of conditions; in immunocompromised hosts, SSTIs can be caused by diverse microorganisms-most commonly bacteria, but also fungi, viruses, mycobacteria, and protozoa. The diagnosis of SSTIs is difficult because they may commonly masquerade as other clinical syndromes or can be a manifestation of systemic disease. In immunocompromised hosts, SSTI poses a major diagnostic challenge, and clinical dermatological assessment should be initially performed; to better identify the pathogen and to lead to appropriate treatment, etiology should include cultures of lesions and blood, biopsy with histology, specific microbiological analysis with special stains, molecular techniques, and antigen-detection methodologies. Here, we reviewed the epidemiology, pathophysiology, clinical presentation, and diagnostic techniques, including molecular biological techniques, used for SSTIs, with a focus on the immunocompromised host, such as patients with cellular immunodeficiency, HIV, and diabetic foot infection.
Teytelman, L., Stoliartchouk, A., Kindler, L., & Hurwitz, B. L. (2016). Protocols.io: Virtual Communities for Protocol Development and Discussion. PLoS biology, 14(8), e1002538.
More info
The detailed know-how to implement research protocols frequently remains restricted to the research group that developed the method or technology. This knowledge often exists at a level that is too detailed for inclusion in the methods section of scientific articles. Consequently, methods are not easily reproduced, leading to a loss of time and effort by other researchers. The challenge is to develop a method-centered collaborative platform to connect with fellow researchers and discover state-of-the-art knowledge. Protocols.io is an open-access platform for detailing, sharing, and discussing molecular and computational protocols that can be useful before, during, and after publication of research results.
Armstrong, D. G., Hurwitz, B. L., & Lipsky, B. A. (2015). Set Phages to Stun: Reducing the Virulence of Staphylococcus aureus in Diabetic Foot Ulcers. Diabetes, 64(8), 2701-3.
Armstrong, D. G., Lew, E. J., Hurwitz, B. L., & Wild, T. (2015). The quest for tissue repair’s holy grail: the promise of wound diagnostics or just another fishing expedition?. Wound Medicine, 8, 1-5.
Hurwitz, B. L., Brum, J. R., & Sullivan, M. B. (2015). Depth-stratified functional and taxonomic niche specialization in the 'core' and 'flexible' Pacific Ocean Virome. The ISME journal, 9(2), 472-84.
More info
Microbes drive myriad ecosystem processes, and their viruses modulate microbial-driven processes through mortality, horizontal gene transfer, and metabolic reprogramming by viral-encoded auxiliary metabolic genes (AMGs). However, our knowledge of viral roles in the oceans is primarily limited to surface waters. Here we assess the depth distribution of protein clusters (PCs) in the first large-scale quantitative viral metagenomic data set that spans much of the pelagic depth continuum (the Pacific Ocean Virome; POV). This established 'core' (180 PCs; one-third new to science) and 'flexible' (423K PCs) community gene sets, including niche-defining genes in the latter (385 and 170 PCs are exclusive and core to the photic and aphotic zones, respectively). Taxonomic annotation suggested that tailed phages are ubiquitous, but not abundant (
Roux, S., Enault, F., Hurwitz, B. L., & Sullivan, M. B. (2015). VirSorter: mining viral signal from microbial genomic data. PeerJ, 3, e985.
More info
Viruses of microbes impact all ecosystems where microbes drive key energy and substrate transformations including the oceans, humans and industrial fermenters. However, despite this recognized importance, our understanding of viral diversity and impacts remains limited by too few model systems and reference genomes. One way to fill these gaps in our knowledge of viral diversity is through the detection of viral signal in microbial genomic data. While multiple approaches have been developed and applied for the detection of prophages (viral genomes integrated in a microbial genome), new types of microbial genomic data are emerging that are more fragmented and larger scale, such as Single-cell Amplified Genomes (SAGs) of uncultivated organisms or genomic fragments assembled from metagenomic sequencing. Here, we present VirSorter, a tool designed to detect viral signal in these different types of microbial sequence data in both a reference-dependent and reference-independent manner, leveraging probabilistic models and extensive virome data to maximize detection of novel viruses. Performance testing shows that VirSorter's prophage prediction capability compares to that of available prophage predictors for complete genomes, but is superior in predicting viral sequences outside of a host genome (i.e., from extrachromosomal prophages, lytic infections, or partially assembled prophages). Furthermore, VirSorter outperforms existing tools for fragmented genomic and metagenomic datasets, and can identify viral signal in assembled sequence (contigs) as short as 3kb, while providing near-perfect identification (>95% Recall and 100% Precision) on contigs of at least 10kb. Because VirSorter scales to large datasets, it can also be used in "reverse" to more confidently identify viral sequence in viral metagenomes by sorting away cellular DNA whether derived from gene transfer agents, generalized transduction or contamination. Finally, VirSorter is made available through the iPlant Cyberinfrastructure that provides a web-based user interface interconnected with the required computing resources. VirSorter thus complements existing prophage prediction softwares to better leverage fragmented, SAG and metagenomic datasets in a way that will scale to modern sequencing. Given these features, VirSorter should enable the discovery of new viruses in microbial datasets, and further our understanding of uncultivated viral communities across diverse ecosystems.
Spichler, A., Hurwitz, B. L., Armstrong, D. G., & Lipsky, B. A. (2015). Microbiology of diabetic foot infections: from Louis Pasteur to 'crime scene investigation'. BMC medicine, 13, 2.
More info
Were he alive today, would Louis Pasteur still champion culture methods he pioneered over 150 years ago for identifying bacterial pathogens? Or, might he suggest that new molecular techniques may prove a better way forward for quickly detecting the true microbial diversity of wounds? As modern clinicians faced with treating complex patients with diabetic foot infections (DFI), should we still request venerated and familiar culture and sensitivity methods, or is it time to ask for newer molecular tests, such as 16S rRNA gene sequencing? Or, are molecular techniques as yet too experimental, non-specific and expensive for current clinical use? While molecular techniques help us to identify more microorganisms from a DFI, can they tell us 'who done it?', that is, which are the causative pathogens and which are merely colonizers? Furthermore, can molecular techniques provide clinically relevant, rapid information on the virulence of wound isolates and their antibiotic sensitivities? We herein review current knowledge on the microbiology of DFI, from standard culture methods to the current era of rapid and comprehensive 'crime scene investigation' (CSI) techniques.
U'Ren, J. M., Wisecaver, J. H., Paek, A. L., Dunn, B. L., & Hurwitz, B. L. (2015). Draft Genome Sequence of the Ale-Fermenting Saccharomyces cerevisiae Strain GSY2239. Genome announcements, 3(4).
More info
Saccharomyces cerevisiae strain GSY2239 is derived from an industrial yeast strain used to ferment ale-style beer. We present here the 11.5-Mb draft genome sequence for this organism.
Hurwitz, B. L., Westveld, A. H., Brum, J. R., & Sullivan, M. B. (2014). Modeling ecological drivers in marine viral communities using comparative metagenomics and network analyses. Proceedings of the National Academy of Sciences of the United States of America, 111(29), 10714-9.
More info
Long-standing questions in marine viral ecology are centered on understanding how viral assemblages change along gradients in space and time. However, investigating these fundamental ecological questions has been challenging due to incomplete representation of naturally occurring viral diversity in single gene- or morphology-based studies and an inability to identify up to 90% of reads in viral metagenomes (viromes). Although protein clustering techniques provide a significant advance by helping organize this unknown metagenomic sequence space, they typically use only ∼75% of the data and rely on assembly methods not yet tuned for naturally occurring sequence variation. Here, we introduce an annotation- and assembly-free strategy for comparative metagenomics that combines shared k-mer and social network analyses (regression modeling). This robust statistical framework enables visualization of complex sample networks and determination of ecological factors driving community structure. Application to 32 viromes from the Pacific Ocean Virome dataset identified clusters of samples broadly delineated by photic zone and revealed that geographic region, depth, and proximity to shore were significant predictors of community structure. Within subsets of this dataset, depth, season, and oxygen concentration were significant drivers of viral community structure at a single open ocean station, whereas variability along onshore-offshore transects was driven by oxygen concentration in an area with an oxygen minimum zone and not depth or proximity to shore, as might be expected. Together these results demonstrate that this highly scalable approach using complete metagenomic network-based comparisons can both test and generate hypotheses for ecological investigation of viral and microbial communities in nature.
Rankin, T. M., Giovinco, N. A., Cucher, D. J., Watts, G., Hurwitz, B., & Armstrong, D. G. (2014). Three-dimensional printing surgical instruments: are we there yet?. The Journal of surgical research, 189(2), 193-7.
More info
The applications for rapid prototyping have expanded dramatically over the last 20 y. In recent years, additive manufacturing has been intensely investigated for surgical implants, tissue scaffolds, and organs. There is, however, scant literature to date that has investigated the viability of three-dimensional (3D) printing of surgical instruments.
Hurwitz, B. L., & Sullivan, M. B. (2013). The Pacific Ocean virome (POV): a marine viral metagenomic dataset and associated protein clusters for quantitative viral ecology. PloS one, 8(2), e57355.
More info
Bacteria and their viruses (phage) are fundamental drivers of many ecosystem processes including global biogeochemistry and horizontal gene transfer. While databases and resources for studying function in uncultured bacterial communities are relatively advanced, many fewer exist for their viral counterparts. The issue is largely technical in that the majority (often 90%) of viral sequences are functionally 'unknown' making viruses a virtually untapped resource of functional and physiological information. Here, we provide a community resource that organizes this unknown sequence space into 27 K high confidence protein clusters using 32 viral metagenomes from four biogeographic regions in the Pacific Ocean that vary by season, depth, and proximity to land, and include some of the first deep pelagic ocean viral metagenomes. These protein clusters more than double currently available viral protein clusters, including those from environmental datasets. Further, a protein cluster guided analysis of functional diversity revealed that richness decreased (i) from deep to surface waters, (ii) from winter to summer, (iii) and with distance from shore in surface waters only. These data provide a framework from which to draw on for future metadata-enabled functional inquiries of the vast viral unknown.
Hurwitz, B. L., Deng, L., Poulos, B. T., & Sullivan, M. B. (2013). Evaluation of methods to concentrate and purify ocean virus communities through comparative, replicated metagenomics. Environmental microbiology, 15(5), 1428-40.
More info
Viruses have global impact through mortality, nutrient cycling and horizontal gene transfer, yet their study is limited by complex methodologies with little validation. Here, we use triplicate metagenomes to compare common aquatic viral concentration and purification methods across four combinations as follows: (i) tangential flow filtration (TFF) and DNase + CsCl, (ii) FeCl3 precipitation and DNase, (iii) FeCl3 precipitation and DNase + CsCl and (iv) FeCl3 precipitation and DNase + sucrose. Taxonomic data (30% of reads) suggested that purification methods were statistically indistinguishable at any taxonomic level while concentration methods were significantly different at family and genus levels. Specifically, TFF-concentrated viral metagenomes had significantly fewer abundant viral types (Podoviridae and Phycodnaviridae) and more variability among Myoviridae than FeCl3 -precipitated viral metagenomes. More comprehensive analyses using protein clusters (66% of reads) and k-mers (100% of reads) showed 50-53% of these data were common to all four methods, and revealed trace bacterial DNA contamination in TFF-concentrated metagenomes and one of three replicates concentrated using FeCl3 and purified by DNase alone. Shared k-mer analyses also revealed that polymerases used in amplification impact the resulting metagenomes, with TaKaRa enriching for 'rare' reads relative to PfuTurbo. Together these results provide empirical data for making experimental design decisions in culture-independent viral ecology studies.
Hurwitz, B. L., Hallam, S. J., & Sullivan, M. B. (2013). Metabolic reprogramming by viruses in the sunlit and dark ocean. Genome biology, 14(11), R123.
More info
Marine ecosystem function is largely determined by matter and energy transformations mediated by microbial community interaction networks. Viral infection modulates network properties through mortality, gene transfer and metabolic reprogramming.
Hurwitz, B. L., Deng, L., Poulos, B. T., & Sullivan, M. B. (2012). Evaluation of methods to concentrate and purify ocean virus communities through comparative, replicated metagenomics. Environmental Microbiology, 15(5), 1428-1440. doi:10.1111/j.1462-2920.2012.02836.x
Degnan, P. H., Leonardo, T. E., Cass, B. N., Hurwitz, B., Stern, D., Gibbs, R. A., Richards, S., & Moran, N. A. (2010). Dynamics of genome evolution in facultative symbionts of aphids. Environmental microbiology, 12(8), 2060-9.
More info
Aphids are sap-feeding insects that host a range of bacterial endosymbionts including the obligate, nutritional mutualist Buchnera plus several bacteria that are not required for host survival. Among the latter, 'Candidatus Regiella insecticola' and 'Candidatus Hamiltonella defensa' are found in pea aphids and other hosts and have been shown to protect aphids from natural enemies. We have sequenced almost the entire genome of R. insecticola (2.07 Mbp) and compared it with the recently published genome of H. defensa (2.11 Mbp). Despite being sister species the two genomes are highly rearranged and the genomes only have ∼55% of genes in common. The functions encoded by the shared genes imply that the bacteria have similar metabolic capabilities, including only two essential amino acid biosynthetic pathways and active uptake mechanisms for the remaining eight, and similar capacities for host cell toxicity and invasion (type 3 secretion systems and RTX toxins). These observations, combined with high sequence divergence of orthologues, strongly suggest an ancient divergence after establishment of a symbiotic lifestyle. The divergence in gene sets and in genome architecture implies a history of rampant recombination and gene inactivation and the ongoing integration of mobile DNA (insertion sequence elements, prophage and plasmids).
Hurwitz, B. L., Kudrna, D., Yu, Y., Sebastian, A., Zuccolo, A., Jackson, S. A., Ware, D., Wing, R. A., & Stein, L. (2010). Rice structural variation: a comparative analysis of structural variation between rice and three of its closest relatives in the genus Oryza. The Plant journal : for cell and molecular biology, 63(6), 990-1003.
More info
Rapid progress in comparative genomics among the grasses has revealed similar gene content and order despite exceptional differences in chromosome size and number. Large- and small-scale genomic variations are of particular interest, especially among cultivated and wild species, as they encode rapidly evolving features that may be important in adaptation to particular environments. We present a genome-wide study of intermediate-sized structural variation (SV) among rice (Oryza sativa) and three of its closest relatives in the genus Oryza (Oryza nivara, Oryza rufipogon and Oryza glaberrima). We computationally identified regional expansions, contractions and inversions in the Oryza species genomes relative to O. sativa by combining data from paired-end clone alignments to the O. sativa reference genome and physical maps. A subset of the computational predictions was validated using a new approach for BAC size determination. The result was a confirmed catalog of 674 expansions (25-38 Mb) and 611 (4-19 Mb) contractions, and 140 putative inversions (14-19 Mb) between the three Oryza species and O. sativa. In the expanded regions unique to O. sativa we found enrichment in transposable elements (TEs): long terminal repeats (LTRs) were randomly located across the chromosomes, and their insertion times corresponded to the date of the A genome radiation. Also, rice-expanded regions contained an over-representation of single-copy genes related to defense factors in the environment. This catalog of confirmed SV in reference to O. sativa provides an entry point for future research in genome evolution, speciation, domestication and novel gene discovery.
Myles, S., Chia, J., Hurwitz, B., Simon, C., Zhong, G. Y., Buckler, E., & Ware, D. (2010). Rapid genomic characterization of the genus vitis. PloS one, 5(1), e8219.
More info
Next-generation sequencing technologies promise to dramatically accelerate the use of genetic information for crop improvement by facilitating the genetic mapping of agriculturally important phenotypes. The first step in optimizing the design of genetic mapping studies involves large-scale polymorphism discovery and a subsequent genome-wide assessment of the population structure and pattern of linkage disequilibrium (LD) in the species of interest. In the present study, we provide such an assessment for the grapevine (genus Vitis), the world's most economically important fruit crop. Reduced representation libraries (RRLs) from 17 grape DNA samples (10 cultivated V. vinifera and 7 wild Vitis species) were sequenced with sequencing-by-synthesis technology. We developed heuristic approaches for SNP calling, identified hundreds of thousands of SNPs and validated a subset of these SNPs on a 9K genotyping array. We demonstrate that the 9K SNP array provides sufficient resolution to distinguish among V. vinifera cultivars, between V. vinifera and wild Vitis species, and even among diverse wild Vitis species. We show that there is substantial sharing of polymorphism between V. vinifera and wild Vitis species and find that genetic relationships among V. vinifera cultivars agree well with their proposed geographic origins using principal components analysis (PCA). Levels of LD in the domesticated grapevine are low even at short ranges, but LD persists above background levels to 3 kb. While genotyping arrays are useful for assessing population structure and the decay of LD across large numbers of samples, we suggest that whole-genome sequencing will become the genotyping method of choice for genome-wide genetic mapping studies in high-diversity plant species. This study demonstrates that we can move quickly towards genome-wide studies of crop species using next-generation sequencing. Our study sets the stage for future work in other high diversity crop species, and provides a significant enhancement to current genetic resources available to the grapevine genetic community.
Cranston, K. A., Hurwitz, B., Ware, D., Stein, L., & Wing, R. A. (2009). Species trees from highly incongruent gene trees in rice. Systematic biology, 58(5), 489-500.
More info
Several methods have recently been developed to infer multilocus phylogenies by incorporating information from topological incongruence of the individual genes. In this study, we investigate 2 such methods, Bayesian concordance analysis and Bayesian estimation of species trees. Our test data are a collection of genes from cultivated rice (genus Oryza) and the most closely related wild species, generated using a high-throughput sequencing protocol and bioinformatics pipeline. Trees inferred from independent genes display levels of topological incongruence that far exceed that seen in previous data sets analyzed with these species tree methods. We identify differences in phylogenetic results between inference methods that incorporate gene tree incongruence. Finally, we discuss the challenges of scaling these analyses for data sets with thousands of gene trees and extensive levels of missing data.
Gore, M. A., Chia, J., Elshire, R. J., Sun, Q., Ersoz, E. S., Hurwitz, B. L., Peiffer, J. A., McMullen, M. D., Grills, G. S., Ross-Ibarra, J., Ware, D. H., & Buckler, E. S. (2009). A first-generation haplotype map of maize. Science (New York, N.Y.), 326(5956), 1115-7.
More info
Maize is an important crop species of high genetic diversity. We identified and genotyped several million sequence polymorphisms among 27 diverse maize inbred lines and discovered that the genome was characterized by highly divergent haplotypes and showed 10- to 30-fold variation in recombination rates. Most chromosomes have pericentromeric regions with highly suppressed recombination that appear to have influenced the effectiveness of selection during maize inbred development and may be a major component of heterosis. We found hundreds of selective sweeps and highly differentiated regions that probably contain loci that are key to geographic adaptation. This survey of genetic diversity provides a foundation for uniting breeding efforts across the world and for dissecting complex traits through genome-wide association studies.
Kim, H., Hurwitz, B., Yu, Y., Collura, K., Gill, N., SanMiguel, P., Mullikin, J. C., Maher, C., Nelson, W., Wissotski, M., Braidotti, M., Kudrna, D., Goicoechea, J. L., Stein, L., Ware, D., Jackson, S. A., Soderlund, C., & Wing, R. A. (2008). Construction, alignment and analysis of twelve framework physical maps that represent the ten genome types of the genus Oryza. Genome biology, 9(2), R45.
More info
We describe the establishment and analysis of a genus-wide comparative framework composed of 12 bacterial artificial chromosome fingerprint and end-sequenced physical maps representing the 10 genome types of Oryza aligned to the O. sativa ssp. japonica reference genome sequence. Over 932 Mb of end sequence was analyzed for repeats, simple sequence repeats, miRNA and single nucleotide variations, providing the most extensive analysis of Oryza sequence to date.
Liang, C., Jaiswal, P., Hebbard, C., Avraham, S., Buckler, E. S., Casstevens, T., Hurwitz, B., McCouch, S., Ni, J., Pujar, A., Ravenscroft, D., Ren, L., Spooner, W., Tecle, I., Thomason, J., Tung, C., Wei, X., Yap, I., Youens-Clark, K., , Ware, D., et al. (2008). Gramene: a growing plant comparative genomics resource. Nucleic acids research, 36(Database issue), D947-53.
More info
Gramene (www.gramene.org) is a curated resource for genetic, genomic and comparative genomics data for the major crop species, including rice, maize, wheat and many other plant (mainly grass) species. Gramene is an open-source project. All data and software are freely downloadable through the ftp site (ftp.gramene.org/pub/gramene) and available for use without restriction. Gramene's core data types include genome assembly and annotations, other DNA/mRNA sequences, genetic and physical maps/markers, genes, quantitative trait loci (QTLs), proteins, ontologies, literature and comparative mappings. Since our last NAR publication 2 years ago, we have updated these data types to include new datasets and new connections among them. Completely new features include rice pathways for functional annotation of rice genes; genetic diversity data from rice, maize and wheat to show genetic variations among different germplasms; large-scale genome comparisons among Oryza sativa and its wild relatives for evolutionary studies; and the creation of orthologous gene sets and phylogenetic trees among rice, Arabidopsis thaliana, maize, poplar and several animal species (for reference purpose). We have significantly improved the web interface in order to provide a more user-friendly browsing experience, including a dropdown navigation menu system, unified web page for markers, genes, QTLs and proteins, and enhanced quick search functions.
Hass-Jacobus, B. L., Futrell-Griggs, M., Abernathy, B., Westerman, R., Goicoechea, J., Stein, J., Klein, P., Hurwitz, B., Zhou, B., Rakhshan, F., Sanyal, A., Gill, N., Lin, J., Walling, J. G., Luo, M. Z., Ammiraju, J. S., Kudrna, D., Kim, H. R., Ware, D., , Wing, R. A., et al. (2006). Integration of hybridization-based markers (overgos) into physical maps for comparative and evolutionary explorations in the genus Oryza and in Sorghum. BMC genomics, 7, 199.
More info
With the completion of the genome sequence for rice (Oryza sativa L.), the focus of rice genomics research has shifted to the comparison of the rice genome with genomes of other species for gene cloning, breeding, and evolutionary studies. The genus Oryza includes 23 species that shared a common ancestor 8-10 million years ago making this an ideal model for investigations into the processes underlying domestication, as many of the Oryza species are still undergoing domestication. This study integrates high-throughput, hybridization-based markers with BAC end sequence and fingerprint data to construct physical maps of rice chromosome 1 orthologues in two wild Oryza species. Similar studies were undertaken in Sorghum bicolor, a species which diverged from cultivated rice 40-50 million years ago.

Proceedings Publications

Ponsero, A., Uren, J. M., Magain, N., Miadlikowska, J., Lutzoni, F., & Hurwitz, B. L. (2020, December). Metagenomic exploration of lichen thalli reveals novel viral communities. In American Geophysical Union Fall 2020.
Schreiber, L., Koskela, R., Kirkpatrick, C., Zanzerkia, E. E., Rubin, K. H., Hurwitz, B. L., Hamman, J., Pierce, S. A., Kosovichev, A. G., Asl, S. D., Tikoff, B., & Asl, S. D. (2019). EarthCube: A Community-Driven Cyberinfrastructure for the Geosciences–A Progress Report. In NSF Earthcube.
Choi, I., Ponsero, A., Youens-Clark, K., Bomhoff, M., Hurwitz, B. L., & Hartman, J. H. (2018, Spring). Libra: Improved Partitioning Strategies for Massive Comparative Metagenomics Analysis. In 9th Workshop on Scientific Cloud Computing.

Presentations

Hurwitz, B. L. (2023, Fall).
Benchmarking Phage Detection Tools for Metagenomics
. Microbiome Data Congress. Boston, MA: Microbiome Times.
Hurwitz, B. L. (2016, January). iMicrobe: a Cyberinfrastrucutre for Microbial Ecology. Plant Animal Genome. San Diego, CA: PAG.
Hurwitz, B. L. (2015, Fall). The New Molecular Microbiology: Impact on Understanding of Diabetic Foot Infections. ASM ICAAC. San Diego, CA: American Society for Microbiology.
Hurwitz, B. L. (2015, February). Big Data for Viral Ecology. American Society for Limnology and Oceanography Conference. Granada, Spain: American Society for Limnology and Oceanography.
Hurwitz, B. L. (2015, January). One Health Meet Big Data Analytics. Vet Sci Advisory Committee for Dean Burgess.
Hurwitz, B. L. (2015, May). The Microbiome Revolution: new tools, new diabetic foot ulcer concepts?. 7th International Symposium on the Diabetic Foot. The Hague, Netherlands: International Symposium on the Diabetic Foot.
Hurwitz, B. L. (2014, January). iMicrobe: Advancing Clinical and Environmental Microbial Research using the iPlant Cyberinfrastructure. Plant Animal Genome. San Diego, CA: PAG.
Hurwitz, B. L. (2014, March). Wound Infection: Moving from Pasteur to CSI; and Panelist: Infection, DemisToephi’d. DFCon. Los Angeles, CA: DFCon Limb Salvage.
Hurwitz, B. L. (2014, November). Metagenomics Meets Big Data Analytics. BME Departmental Seminar.
Hurwitz, B. L. (2014, October). The Battle of Infection: Implant related biofilm, can we define it, treat it, and prevent it?. Diabetic Limb Salvage Conference. Washington, DC: Diabetic Limb Salvage Conference.
Hurwitz, B. L. (2014, October). iMicrobe: A Cyberinfrastructure for Microbial Ecology. UA Plant Sciences Departmental Seminar.

Poster Presentations

Hurwitz, B. L. (2014, October). Biological Research Innovation through Dynamic Graph Engineering. Biological Data Science. Cold Spring Harbor, NY.

Edit my profile

Profiles search form

Bonnie L Hurwitz

Biography

Degrees

Work Experience

Awards

Licensure & Certification

Related Links

Interests

Teaching

Research

Courses

2025-26 Courses

Intro to BAT using CURE

Metagenomics

Metagenomics

2024-25 Courses

Applied Cyberinfrastruct Conc

Applied Cyberinfrastruct Conc

Applied Cyberinfrastruct Conc

Applied Cyberinfrastruct Conc

Biosystems Analytics

Biosystems Analytics

Biosystems Analytics

Dissertation

Dissertation

Independent Study

Thesis

Dissertation

Internship

Intro to BAT using CURE

Metagenomics

Metagenomics

Thesis

2023-24 Courses

Biosystems Analytics

Biosystems Analytics

Biosystems Analytics

Curr Top Plant Sci-Adv

Dissertation

Dissertation

Independent Study

Internship

Internship

Thesis

Dissertation

Metagenomics

Metagenomics

Thesis

Thesis

2022-23 Courses

Biosystems Analytics

Biosystems Analytics

Biosystems Analytics

Dissertation

Dissertation

Internship

Lab Presentations & Discussion

Dissertation

Dissertation

Intro to BAT using CURE

Lab Presentations & Discussion

Metagenomics

Metagenomics

2021-22 Courses

Dissertation

Dissertation

Lab Presentations & Discussion

Dissertation

Dissertation

Lab Presentations & Discussion

2020-21 Courses

Dissertation

Dissertation

Lab Presentations & Discussion

Dissertation

Dissertation

Independent Study

Lab Presentations & Discussion

Metagenomics