Previous Article | Next Article ![]()
Applied and Environmental Microbiology, December 2006, p. 7575-7585, Vol. 72, No. 12
0099-2240/06/$08.00+0 doi:10.1128/AEM.01174-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
,
Department of Food Science,1 Department of Population Medicine and Diagnostic Sciences, Cornell University, Ithaca, New York 14853,2 Wadsworth Center, New York State Department of Health, Albany, New York3
Received 21 May 2006/ Accepted 25 September 2006
|
|
|---|
|
|
|---|
Characterization of Salmonella isolates and human and animal salmonellosis surveillance have traditionally used serotyping for subtyping and strain differentiation (11). This technique relies upon the immunoreactivity of Salmonella O and H antigens. The O antigen is found in the lipopolysaccharide layer of the bacterial cell wall, whereas the H antigen is the filamentous portion of the flagella (11). Even though over 2,500 different Salmonella serotypes can be differentiated (11), some serotypes are commonly associated with human salmonellosis infections (e.g., Salmonella enterica serotype Typhimurium, S. enterica serotype Enteritidis, S. enterica serotype Newport, and S. enterica serotype Heidelberg) (11), limiting the value of serotyping for human disease surveillance. More discriminatory subtyping methods, such as phage typing (3, 32) and pulsed-field gel electrophoresis (PFGE), are thus commonly used to subtype Salmonella, particularly as part of national and international salmonellosis surveillance systems (8, 9, 24, 25, 50). These methods are generally used in conjunction with serotyping, since serotyping information can still provide valuable information about host-associated subtypes, the emergence of new subtypes, and historical trends in the association of specific Salmonella subtypes with different hosts species. Although serotyping, PFGE, and phage typing, particularly if used in combination, can provide a high level of subtype discrimination, none of these methods provides appropriate information to infer phylogenetic relationships among Salmonella isolates and subtypes. Multilocus sequence typing (MLST) is a subtyping method that determines the nucleotide sequences of full or partial housekeeping genes. Advantages of MLST include not only that the resulting DNA sequences data are nonambiguous and easily compared between laboratories, e.g., through large World Wide Web-based databases (24, 25, 50), but also that the DNA sequence data generated can be used to infer phylogenetic relationships among isolates, providing improved insight into the evolution and ecology of Salmonella subtypes. Even though MLST originally was defined as a sequencing-based subtyping approach that includes the sequencing of 450 to 600 nucleotide (nt) fragments for six to seven housekeeping genes (38, 51), Salmonella MLST schemes described in the literature have used sequencing of three to four genes (37, 49); some of these MLST schemes also included the sequencing of virulence or virulence-associated genes, e.g., spaM and fimA (23, 49). Although a seven-gene Salmonella MLST scheme is available through a WWW page maintained by the Max Plank Institute for Infection Biology in Berlin, Germany (6), we chose a previously described three-gene MLST, which was shown to provide discriminatory power similar to that of a seven-gene MLST (49), for the study reported here, because the sequencing of fewer genes provides a more economical subtyping approach.
The transmission of Salmonella from animal populations to humans represents a continuing concern (5), particularly because the incidence of human salmonellosis cases in the United States does not appear to have decreased substantially despite considerable efforts to reduce food-borne diseases (44). Whereas a variety of animal reservoirs (e.g., poultry, pigs, cattle, and reptiles) can serve as direct or indirect, food-borne sources for human Salmonella infections, our study reported here focused on probing the subtype diversity and epidemiology of human- and cow-associated Salmonella populations. Even though source attribution of sporadic salmonellosis cases is difficult, a number of human salmonellosis outbreaks have been linked to contaminated foods of bovine origin (milk or beef) (41). In addition, it has been suggested that antibiotic use in dairy cattle is selecting for multidrug-resistant (MDR) Salmonella strains, which may be transmitted to the human population via food (4, 13), as supported by the isolation of multidrug-resistant Salmonella strains from cattle (2, 36, 53) and human salmonellosis outbreaks linked to beef products contaminated with multidrug-resistant Salmonella strains (13, 15). Furthermore, direct contact between cattle and humans may also be involved in the transmission of Salmonella between these populations (33, 48). We assembled a set of 335 Salmonella strains isolated from humans and cattle in the same general region (New York state and Vermont) over the same time (2004) for characterization using serotyping and a three-gene MLST scheme in order to allow population-based comparisons of human- and cow-associated Salmonella subtypes and to evaluate the potential for cow-associated subtypes to be transmitted to humans. By comparison, a previous study by our group (49), which used isolates predominantly from domestic animals, reported the development of a three-gene MLST scheme but did not use this subtyping method to characterize the transmission of Salmonella between different host species.
|
|
|---|
Human and bovine Salmonella isolates were serotyped at the NYSDOH and the National Veterinary Services Laboratory (USDA-APHIS-VS, Ames, IA) using standard procedures (22).
PCR, DNA sequencing, and MLST.
The MLST scheme used here was based on PCR amplification and sequencing of three genes (manB, fimA, and mdh) as previously reported (49). As also previously described (49), the full fimA open reading frame (558 nt) and partial manB and mdh open reading frames (640 nt and 520 nt, respectively) were sequenced and used for allele assignments. While Salmonella lysates for PCR were initially prepared as previously described (49), preparation of purified Salmonella DNA using the QIAamp DNA Mini kit (QIAGEN Inc., Chatsworth, CA) replaced this lysate protocol to provide more consistent PCR results.
PCR amplification of manB, fimA, and mdh was performed essentially as previously described (2). All PCR products were purified using the QIAquick PCR purification kit (QIAGEN Inc.) and quantified using either a fluorescent DNA quantitation kit (Bio-Rad, Hercules, CA) or a Nanodrop spectrophotometer (NanoDrop Technologies Inc., Willmington, DE). Purified PCR products were sequenced by the Biotechnology Resource Center at Cornell University or by Macrogen Inc. (Geumcheon-gu, Seoul, Korea) as previously described (2). All sequences were assembled and proofread using SeqMan and aligned using the Clustal W algorithm in MegAlign (DNAStar, Madison, WI).
Allele assignments for individual genes were performed using DnaSP 4.0 (45); two sequences were assigned different allelic types if they differed by at least one nucleotide. Sequence types (STs) were assigned so that isolates that have identical allelic types for all three genes have the same ST. Allelic types and STs were assigned to be consistent with previous studies that used the same MLST scheme (2, 49), i.e., ST6 in the study reported here is identical to ST6 reported previously by Alcaine et al. (2).
Phylogenetic analyses.
Phylogenetic analyses were performed essentially as previously described (2). Briefly, neighbor-joining (NJ) and maximum likelihood (ML) trees were constructed for each gene using one representative sequence for each allelic type for a given gene. PAUP* 4.0b10 (Sinauer Associates, Sunderland, MA) was used to construct NJ and ML trees. MODELTEST (43) was used to find the most likely model of DNA substitution for a given gene for ML trees (i.e., TrNef+G for fimA and mdh and HKY+I+G for manB); NJ trees were constructed using the HKY85 nucleotide substitution model. Each tree was rooted with the manB, fimA, or mdh sequence for Escherichia coli O157:H7 (31), which served as the outgroup. Phylogenetic trees were constructed based on individual sequences only and were not based on concatenated sequences, since a number of manB sequences showed mixed bases due to the presence of two gene copies (49), preventing their inclusion in the phylogenetic analyses.
Statistical analyses.
The frequency distributions of serotypes and STs between human and bovine isolates were compared using the chi-square test of independence. For comparisons where one or more of the expected values were <5, Fisher's exact test was performed. P values of <0.05 were considered statistically significant. Chi-square and Fisher's exact tests were performed using SAS version 9.1 (SAS Institute Inc., Cary, NC). P values for exact tests for large contingency tables (e.g., 2 by 10) were determined by a Monte Carlo simulation using SAS version 9.1.
Simpson's index of discrimination (SID) was calculated as described previously by Hunter and Gaston (34); this index provides an indication of the discriminatory power of a given subtyping method (34) as well as an estimate of the subtype diversity within a given population (26). The 95% confidence intervals for SID were calculated as described previously by Grundmann et al. (26), and SIDs were considered significantly different if 95% confidence intervals did not overlap. All calculations were performed using Microsoft (Seattle, WA) Excel.
Among the bovine Salmonella isolates included in this study, multiple isolates obtained from the same farm (but different animals) at different sampling times showed the same serotypes and STs, indicating the reisolation of a persistent subtype on a given farm. Therefore, only one isolate representing each unique serotype/ST combination for a given farm was included in the summary statistics as well as in the chi-square test and SID calculations to avoid an overrepresentation of a subtype due to resampling. For example, while 20 Salmonella Newport isolates with ST11 were obtained from farm 510 (Table 1), only one of the isolates with this subtype combination from farm 510 was included in isolate numbers used for statistical analyses. When this approach was used to include only one isolate representing each unique serotype/ST combination found on a given farm, a total of 78 bovine isolates were obtained for inclusion in the statistical analyses reported here (i.e., see Table 2 through 5).
|
View this table: [in a new window] |
TABLE 1. Farms with multiple sample submission dates resulting in the isolation of Salmonella from clinically infected cattle
|
|
View this table: [in a new window] |
TABLE 2. Distribution of Salmonella serotypes among STs and human and bovine isolates
|
|
View this table: [in a new window] |
TABLE 5. Serotypes that include at least one isolate that carries a deletion in the 3' end of fimAa
|
|
|
|---|
MLST provides subtype discrimination of Salmonella that is only slightly more sensitive than serotyping.
The 335 Salmonella isolates included in this study could be differentiated into a total of 52 serotypes and 72 STs. fimA and mdh sequence data allowed the differentiation of 40 and 28 allelic types, respectively. Analysis of manB sequence data revealed a total of 33 isolates that showed reproducible double peaks at 1 to 15 nt positions, consistent with the presence of two copies of manB in these isolates, as previously reported for a smaller number of isolates (2). For each manB nucleotide position with a double peak, an International Union of Pure and Applied Chemistry (IUPAC) ambiguity nucleotide code indicating the presence of the two bases found (e.g., Y indicates the presence of C or T) was used to designate the final sequences for a given isolate. Isolates were assigned different manB allelic types if two isolates differed in their nucleotide sequences, including the ambiguous nucleotides (mixed bases), consistent with similar approaches that have been used to assign subtypes based on 16S rRNA gene sequence data that indicated the presence of multiple distinct 16S rRNA genes in a given organism (46). Using this approach, a total of 55 manB allelic types were differentiated. The numbers of polymorphic sites in manB, fimA, and mdh were 76, 260, and 85, respectively (calculated based on the allelic types reported here and in reference 49).
When serotype and MLST typing data were combined to assign overall subtypes (i.e., only isolates with the same serotype and ST were considered the same subtype), a total of 81 subtypes were differentiated. Analysis of combined serotype/ST-based subtypes allowed us to analyze the Salmonella diversity on the 20 farms from which Salmonella isolates were collected over multiple visits in 2004 (Table 1). On some farms, a given Salmonella subtype persisted over time; e.g., on one farm, an ST11 Salmonella serotype Newport strain was isolated from samples collected on 20 separate dates from 20 different animals (Table 1). These data showed the persistence of Salmonella subtypes over time in cattle on 18 farms, including the persistence of Salmonella serotype Newport ST11 on a total of 9 farms. We thus conclude that Salmonella serotype Newport ST11 may be characterized by its particular ability for persistence, consistent with a recent report that also showed the persistence of multidrug-resistant Salmonella serotype Newport strains on two farms in Washington state in the United States (18).
Among human isolates, a total of nine instances where two or three isolates with the same subtype (i.e., the same serotype and ST) were obtained from humans in the same county within <2 months of each other were observed. Whereas five of these instances are likely to represent small clusters, in the other four instances, a retrospective analysis showed that two or three isolates with the same ST had been obtained from the same patient. In one case, both isolates were obtained from the same patient on the same day; the time spans between obtaining the first and the last isolates for the other three patients were 2, 27, and 54 days, respectively (isolates were serotype Typhimurium, serotype 4,5,12:i:-, and serotype Abony, respectively).
A total of 17 serotypes included two or more STs, including the differentiation of six and five STs within isolates of serotype Newport and serotype Typhimurium. Among the 72 STs differentiated in this study, 7 STs included isolates representing multiple serotypes. These findings are consistent with previous studies, which also found the differentiation of multiple serotypes within a given ST (2, 49). The overall discriminatory abilities, as determined by SID, for serotyping and MLST were 0.917 and 0.920, respectively. The SID for combined serotype and MLST data was 0.943. Even though other subtyping methods, particularly PFGE, appear to provide even more sensitive subtype discrimination (23), we conclude that serotyping and MLST provide the appropriate subtype discrimination to evaluate the ecology and epidemiology of human and bovine Salmonella isolates, particularly since MLST data allow the phylogenetic analysis and definition of Salmonella clonal groups (49), which are defined here as groups of isolates that have closely related STs as determined by phylogenetic analyses and thus are presumed to have derived from a recent common ancestor.
Salmonella subtypes isolated from cattle and humans represent distinct and overlapping populations.
The Salmonella isolates characterized in our study represented 35 and 6 serotypes unique to human and bovine isolates, respectively, as well as 11 serotypes found among both host species. The most common human-associated serotypes were serotype Typhimurium, serotype Enteritidis, serotype Newport, serotype 4,5,12:i:-, and serotype Heidelberg, largely consistent with CDC data for 2003, which listed serotype Typhimurium, serotype Enteritidis, serotype Newport, serotype Heidelberg, and serotype Javiana as the five most commonly reported human Salmonella serotypes in the United States (11). This indicates that the human isolates included in our study are similar in their serotype composition to isolates associated with human clinical disease previously collected throughout the United States. Comparison of the frequency distributions of serotypes among human and bovine isolates using an overall chi-square analysis (with all serotypes that occurred less than five times grouped into a single category, termed "rare serotypes") showed that serotypes were not randomly distributed among human and bovine isolates (P < 0.001; Monte Carlo estimation of exact test). Pairwise chi-square tests (comparing the frequency of a given serotype to the frequency of all other serotypes) showed that serotype Enteritidis and serotype Heidelberg were overrepresented among human isolates, whereas serotype Newport and serotype Muenster were overrepresented among bovine isolates. Serotype Enteritidis and serotype Heidelberg were exclusive to human isolates, consistent with their well-recognized association with poultry and particularly chickens (11, 27, 47) and their rare presence in cattle (11, 27). In addition to being common among human isolates, serotype Typhimurium and serotype Newport were also the two most common bovine-associated serotypes, consistent with both U.S.-wide CDC data (11) and the fact that human salmonellosis outbreaks caused by these two serotypes have been linked to the consumption of contaminated beef and dairy products (12, 13, 42). While this finding provided evidence that cattle may be an important reservoir for these two serotypes, the sources of human Salmonella serotype Newport infections probably depend on the specific Salmonella serotype Newport clonal group, as discussed below. In addition to the chi-square test data, which showed that the individual serotype frequency differed among bovine and human isolates, SIDs also showed that human serotype diversity (SID = 0.931) was significantly higher than bovine serotype diversity (SID = 0.832), further supporting the hypothesis that distinct Salmonella populations are associated with these two host species.
MLST identified 56 and 6 STs that were unique to human and bovine isolates, respectively, as well as 10 STs that were found in both host species. A comparison of the frequency distributions of STs among human and bovine isolates using an overall chi-square analysis (with all STs that occurred less than five times being grouped into a single category, termed "rare STs") showed that STs were not randomly distributed among human and bovine isolates (P < 0.001; Monte Carlo estimation of exact test). Pairwise chi-square tests (comparing the frequency of a given ST to the frequency of all other STs) showed that two and three STs were overrepresented among isolates from humans (ST3 and ST14) and cattle (ST8, ST11, and ST44), respectively (Table 3), further supporting the finding that certain subtypes were not distributed evenly among human and cattle isolates. ST3 and ST14, which were overrepresented among human isolates, represent serotype Heidelberg and serotype 4,12:r:- (ST3) as well as serotype Enteritidis (ST14), consistent with the overrepresentation of these serotypes among human isolates as discussed above. Interestingly, ST8, which includes isolates classified as serotype Typhimurium and related serotypes (Table 3), was overrepresented among bovine isolates and was never isolated from human clinical cases, potentially indicating that this specific ST shows bovine host specificity. Alternatively, this ST may not yet have been transmitted to humans in significant numbers. ST11 and ST44, which were also more common among bovine isolates than human isolates, represented serotype Newport and serotype Bardo (ST11) as well as serotype Muenster (ST44), consistent with an overrepresentation of these serotypes among bovine isolates (Table 2). Thus, MLST data not only confirmed the host association of specific serotypes but also identified a bovine-associated ST within serotype Typhimurium, which could not have been identified based on serotype data alone.
|
View this table: [in a new window] |
TABLE 3. Distribution of Salmonella STs among human and bovine isolates and among different counties in New York state
|
A number of Salmonella STs, such as ST6, which includes emerging serotype 4,5,12:i:-, are geographically widespread among human and/or bovine populations.
Analysis of geographic source data for human and bovine isolates (Table 3) showed that a number of STs are distributed widely across New York state (Fig. 1), including ST3, ST14, and ST36, which were found only among human isolates, as well as ST8, which was found only among bovine isolates. ST6 and ST11, the two most common STs found in this study, were widely distributed among both human and bovine populations in New York state. Interestingly, ST6 not only contains Salmonella serotype Typhimurium isolates but also includes serotype 4,5,12:i:- and serotype 4,12:i:- (Table 3), indicating that these serotypes share a common ancestor, consistent with data from phylogenetic trees (Fig. 2), which grouped ST6 with other Salmonella serotype Typhimurium STs (ST7, ST8, ST47, and ST49). PFGE, random amplified polymorphic DNA, plasmid profile, and ribotyping-based studies on small isolate sets (<50 isolates) also previously concluded that serotype 4,5,12:i:- originated from a Salmonella serotype Typhimurium ancestor (19, 28). The ecology and epidemiology of Salmonella serotype 4,5,12:i:- are of particular interest, since it appears to represent an emerging human disease-associated Salmonella serotype (11). In 2003, serotype 4,5,12:i:- was identified as the 14th most common human Salmonella serotype in the United States (11), but due to difficulties in the classification of this serotype, many serotype 4,5,12:i:- isolates may have simply been reported as subspecies I, group B, and its prevalence may thus have been greater than previously reported (11). Serotype 4,5,12:i:- was also implicated in a food-borne salmonellosis outbreak in New York City in 2002 (1). The observation that this serotype was the fourth most common human-associated serotype in our study and that ST6, which includes serotype 4,5,12:i:- as well as other related serotypes (Table 3), is the most common human disease-associated and second most common cattle-associated ST (Table 3) indicates the public health importance of this specific ST, including the potential importance of bovine hosts as a reservoir for this ST.
![]() View larger version (17K): [in a new window] |
FIG. 1. Map of New York and Vermont counties where Salmonella isolates were obtained. The map was drawn using MapViewer 4.01 (Golden Sofware, Inc., Golden, CO). County codes are as follows: Alb, Albany; Brx, Bronx; Brm, Broome; Cat, Cattaraugus; Cay, Cayuga; Cha, Chautauqua; Cmg, Chemung; Cno, Chenango; Cvt, Chittenden, VT; Cli, Clinton; Cor, Cortland; Del, Delaware; Dut, Dutchess; Eri, Erie; Fra, Franklin; Fvt, Franklin, VT; Gen, Genesee; Kin, Kings; Lvt, Lamoille, VT; Lew, Lewis; Liv, Livingston; Mon, Monroe; Nas, Nassau; New, New York; Nia, Niagara; One, Oneida; Ono, Onondaga; Ont, Ontario; Ora, Orange; Orl, Orleans; Osw, Oswego; Ots, Otsego; Put, Putnam; Ren, Rensselaer; Sar, Saratoga; Sch, Schenectady; Sen, Seneca; Stl, St. Lawrence; Ste, Steuben; Suf, Suffolk; Tom, Tompkins; Uls, Ulster; Unk, Unkown; War, Warren; Was, Washington; Way, Wayne; Wes, Westchester; and Wyo, Wyoming.
|
![]() View larger version (17K): [in a new window] |
FIG. 2. Phylogenetic trees based on Salmonella fimA (A) and mdh (B) gene sequences. Neighbor-joining trees were built in PAUP* using one representative isolate for each sequence type, including STs previously described by Sukhnanand et al. (49), which were not represented among the isolates found here (these STs are indicated by a *). ML trees (not shown) displayed similar clusterings and topologies. Bootstrap analyses were performed using 5,000 replications, and bootstrap values >50 are indicated. M indicates STs representing isolates that contain a manB gene duplication; the branch labeled as "clade 1" represents a number of related STs, including STs (ST52 to ST57, ST63, and ST67 in the mdh tree) that correspond to isolates with two manB copies, indicating a common ancestral event that was responsible for the two manB copies found in these isolates. D1, D2, and D3 indicate STs representing isolates that contain fimA deletion types D1 through D3 (Table 6). STs representing the two distinct Salmonella serotype Newport groups (designated types A and B) are marked by circles or boxes and the respective type.
|
Duplication and deletion events in manB and fimA may provide a mechanism for rapid diversification of Salmonella surface molecules.
DNA sequence data for manB revealed the presence of two copies of manB in a total of 27 human and 6 bovine isolates (Table 4), as demonstrated by the presence of mixed bases at a number of nucleotide positions after PCR products were sequenced. This was also observed in a previous study where we reported and confirmed, using cloning and sequencing of manB PCR products, the presence of two manB alleles in three Salmonella serotype Montevideo isolates (49). Phylogenetic analyses of the duplicate manB genes found in these serotype Montevideo isolates showed that an additional manB copy had likely been introduced into an ancestral strain by horizontal gene transfer from a different Salmonella serotype similar to serotype Javiana (49). In the study reported here, we found evidence for the presence of two manB genes in 16 Salmonella serotypes (including serotype Montevideo) and in 19 STs (Table 4). fimA- and mdh-based phylogenetic trees (Fig. 2) showed that STs representing isolates with two manB copies represent a number of distinct subtypes, indicating that multiple independent horizontal gene transfer events (or duplication events) contributed to the presence of two manB copies in different Salmonella subtypes. While positive selection for the presence of multiple manB copies could account for this observation, expression of both manB copies would be necessary and needs to be established in future experiments. Several STs (ST52 to ST57, ST63, and ST67) that include isolates with two manB copies formed a similar cluster in fimA and mdh trees (designated clade 1 in Fig. 2). This may be the result of a common ancestral event (e.g., horizontal gene transfer or manB duplication) that was responsible for the two manB genes found in these isolates. Although the benefit of two distinct copies of manB in a single isolate remains to be determined, the fact that manB encodes phosphomannomutase, an enzyme that is part of the chemical pathway necessary to produce GDP-D-mannose, an important sugar subunit of the Salmonella O antigen (35), may suggest that the presence of two manB copies provides a possible mechanism to rapidly generate serotype diversity. This is consistent with the observation that isolates in clade 1 that carry two manB copies represented a total of six serotypes, including serotype Agbeni, serotype Montevideo, serotype Poona, serotype Oranienburg, serotype Rublislaw, and serotype Urbana. Further research, including the generation and characterization of manB deletion mutants, is clearly needed to better understand the potential contributions of multiple manB copies to generating Salmonella serotype diversity.
|
View this table: [in a new window] |
TABLE 4. Serotypes that include at least one isolate that carries two manB copiesa
|
|
View this table: [in a new window] |
TABLE 6. DNA and amino acid sequences of fimA deletion types D1, D2, and D3a
|
We thank R. Orsi, K. Nightingale, and H. Franks for help with data analyses.
Published ahead of print on 6 October 2006. ![]()
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»