Abstract
Salmonella enterica and Escherichia coli are bacterial species that colonize different animal hosts with sub-types that can cause life-threatening infections in humans. Source attribution of zoonoses is an important goal for infection control as is identification of isolates in reservoir hosts that represent a threat to human health. In this study, host specificity and zoonotic potential were predicted using machine learning in which Support Vector Machine (SVM) classifiers were built based on predicted proteins from whole genome sequences. Analysis of over 1000 S.enterica genomes allowed the correct prediction (67 -90 % accuracy) of the source host for S. Typhimurium isolates and the same classifier could then differentiate the source host for alternative serovars such as S. Dublin. A key finding from both phylogeny and SVM methods was that the majority of isolates were assigned to host-specific sub-clusters and had high host-specific SVM scores. Moreover, only a minor subset of isolates had high probability scores for multiple hosts, indicating generalists with genetic content that may facilitate transition between hosts. The same approach correctly identified human versus bovine E. coli isolates (83 % accuracy) and the potential of the classifier to predict a zoonotic threat was demonstrated using E. coli O157. This research indicates marked host restriction for both S. enterica and E. coli, with only limited isolate subsets exhibiting host promiscuity by gene content. Machine learning can be successfully applied to interrogate source attribution of bacterial isolates and has the capacity to predict zoonotic potential.
| Original language | English |
|---|---|
| Article number | e000135 |
| Pages (from-to) | 1-10 |
| Number of pages | 10 |
| Journal | Microbial Genomics |
| Volume | 3 |
| Issue number | 10 |
| DOIs | |
| Publication status | Published - 1 Oct 2017 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Support Vector Machine
- Salmonella
- Zoonosis
- Host specificity
- Machine learning
- E. coli
Fingerprint
Dive into the research topics of 'Patchy promiscuity: machine learning applied to predict the host specificity of Salmonella enterica and Escherichia coli'. Together they form a unique fingerprint.Research output
- 41 Citations
- 1 Article
-
Erratum: Patchy promiscuity: machine learning applied to predict the host specificity of Salmonella enterica and Escherichia coli
Lupolova, N., Dallman, T. J., Holden, N. J. & Gally, D. L. (Lead / Corresponding author), 6 Jun 2018, In: Microbial Genomics. 4, 6Research output: Contribution to journal › Article › peer-review
Open AccessFile22 Link opens in a new tab Citations (Scopus)193 Downloads (Pure)
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver