|
|
||||||||
1 Biophysics Laboratories, St. Michael's Building, University of Portsmouth, White Swan Road, Portsmouth PO1 2DT, UK
2 Ludwig Institute of Cancer Research, UCL School of Medicine Branch, London W1P 8BT, UK
3 Department of Biochemistry and Molecular Biology, University College London, Gower Street, London WC1E 6BT, UK
Reprint requests to: Colyn Crane-Robinson, Biophysics Laboratories, St. Michael's Building, University of Portsmouth, White Swan Road, Portsmouth PO1 2DT, UK; e-mail: colyn.crane-robinson{at}port.ac.uk; fax 44 (0) 23 92 842053.
(RECEIVED August 1, 2000; FINAL REVISION October 23, 2000; ACCEPTED October 24, 2000)
4 Present address: RiboTargets Ltd, Granta Park, Abington, Cambridgeshire CB1 6GB, UK. ![]()
Article and publication are at www.proteinscience.org/cgi/doi/10.1110/ps.32801.
| Abstract |
|---|
|
|
|---|
90°.
Keywords: HMG box; LEF-1; TCF1
SRY; DNA bending
Abbreviations: NOESY, nuclear Overhauser enhancement and exchange spectroscopy HOHAHA, homonuclear Hartmann Hahn HMQC, heteronuclear multiple quantum coherence HSQC, heteronuclear single quantum coherence CD, circular dichroism DSC, differential scanning calorimetry.
| Introduction |
|---|
|
|
|---|
-chain enhancer. It is clear that the sequence-specific HMG box is a DNA-binding domain found in many transcription factors playing key developmental roles (for review, see Wegner 1999). A second class of HMG boxes that bind DNA are those that show no obvious DNA sequence-specific recognition but prefer to bind to preformed specific DNA structures such as the 4-way junction (Wright and Dixon 1988; Bianchi et al. 1989, 1992; Ferrari et al. 1992; Webb and Thomas 1999), DNA bulges (Payet et al. 1999), and cisplatin-modified DNA (Pil and Lippard 1992; Ohndorf et al. 1999). These non-sequence-specific HMG boxes frequently occur as multiple boxes and include those from the mammalian chromosomal proteins HMG1 and HMG2 (Johns 1982), Upstream Binding Factor UBF (Jantzen et al. 1990; Bazett-Jones et al. 1994), and mtTF1 (Parisi and Clayton 1991). The term ``architectural transcription factors'' has been applied to these HMG proteins (Groschedl et al. 1994; Wolffe 1994), to indicate their ability to manipulate the structure of the DNA to which they bind.
The structures of several HMG boxes have been determined for both the non-sequence-specific (Read et al. 1993; Weir et al. 1993; Jones et al. 1994; Hardman et al. 1995; Allain et al. 1999) and sequence-specific (van Houte et al. 1995) classes. Additionally, the structure of HMG boxes in complex with DNA is known, both non-sequence-specific (Allain et al. 1999; Murphy et al. 1999; Ohndorf et al. 1999) and sequence-specific (Love et al. 1995; Werner et al. 1995). In all cases the protein fold is observed to be a 3-helix bundle in the form of a somewhat asymmetric L-shape with helices 1 and 2 in one arm (the major wing) and most of helix 3 together with the most N-terminal residues in the other arm (the minor wing). The structures of the sequence-specific HMG box of hSRY bound to 8-bp DNA (Werner et al. 1995) and of mLEF-1 bound to 15-bp DNA (Love et al. 1995) showed that the L-shaped protein takes the minor groove of the DNA in a pincer-like grip and causes the DNA to bend away from the protein, toward the major groove, with considerable unwinding. DNA bending is brought about by a series of contacts on the inside of the L-shaped HMG box fold and in part by partial intercalation of a single hydrophobic sidechain between two adjacent adenines.
The mLEF-1/DNA complex additionally revealed that residues C-terminal to the minimal HMG box crossed over the major groove on the inside of the bend to make contact with the far end of the DNA (Love et al. 1995). This feature may explain the exceptionally large bend angle (130°) generated by mLEF-1 (Giese et al. 1992), shown to be dependent on the presence of a C-terminal extension (Lnenicek-Allen et al. 1996). In the model of the non-sequence-specific HMG box of NHP6A bound to DNA, a basic N-terminal extension of the folded domain wraps around the major groove (Allain et al. 1999).
Absent from our understanding of sequence-specific HMG box structures and their interactions with DNA is knowledge of the structure of the same HMG box both free in solution and bound to DNA, since the free-solution structures of neither hSRY nor mLEF-1 have been determined. This is a matter of particular interest since calorimetric measurements of the mSox-5 HMG box have shown that significant levels of protein refolding occur on association, in addition to the DNA bending (Privalov et al. 1999). The present study of mSox-5 represents the first part of a project to make a detailed comparison of the structure of a free and DNA-bound sequence-specific HMG box. We also take the opportunity to compare the structure of the mSox-5 HMG box with other HMG box structures both free and bound to DNA.
| Results |
|---|
|
|
|---|
H protons of residues 5, 19, 58, 75, and 78 nor the lysine C
H protons of residues 4, 15, 35, 42, 49, 61, 66, 71, 73, and 77. The assignment of the backbone 15N resonances was complete except for N30 and S31, for both of which the amide proton is in fast exchange with the solvent. The sidechain 15N resonances of all the asparagine, glutamine, and tryptophan residues were completely assigned, plus the 15N
resonances of residues R18, R19, and R40. The remaining sidechain nitrogen resonances could not be identified.
The NOE restraints
From the 2D and 3D NOESY spectra, a total of 1424 nonredundant distance restraints were established, of which 277 were intraresidue and 1147 were interresidue452 sequential (i, i + 1), 145 (i, i + 2), 159 (i, i + 3), 70 (i, i + 4) and 321 long range (i, i > 4). Figure 1
shows the distribution of NOE contacts in the mSox-5 HMG box. The (i, i + 3) and (i, i + 4) contacts run parallel to the diagonal and indicate the three
-helical regions F10 to F25, N32 to A43, and P51 to Y67. The location of helices can also be seen from Figure 2B
, which documents the runs of d
N (i, i + 3) and d
N (i, i + 4) cross-peaks that define these regions as being
-helical. This assignment of helices is supported by the Chemical Shift Index of the C
H protons (Fig. 2C
). Together these represent 45 out of 81 residues (55%) in an
-helical conformation, in good agreement with CD data (Connor et al. 1994). The position and length of the three
helices in mSox-5 are similar to those observed in other sequence-specific HMG box structures (Love et al. 1995; Werner et al. 1995).
|
|
A plot of the number of NOE restraints against position in the sequence (Fig. 2A
) shows that the most ill-defined regions correspond to P1H2, R5M7, N30S31, T45L47, and P74T79. Lack of definition in the R5M7 region was partly caused by chemical shift overlap, which also prevented observation of sequential amide NOEs between residues Y70K71 and Y72K73. The Chemical Shift Index (Fig. 2C
) suggests that residues 18 and 7379 are in an irregular conformation (CSI = 0).
The volume of peaks arising from each amide proton in the domain was measured in a 15N1H HSQC spectrum. A plot of the relative peak volumes (Fig. 2D
) shows that these are smaller for residues I3M7, H29N32, T45L47, and H63, in comparison with the remainder. This is probably because of fast solvent exchange of these amide protons and could account for the fewer NOEs observed in these regions. Measurement of NH proton cross-peak volumes at the NH and water chemical shift positions in 3D NOESY spectra confirmed this rapid exchange (see Fig. 2E,F
). Helix 3 (P51Y67) was found to contain slowly exchanging amide protons up to residue L59, beyond which more rapid exchange occurs.
3 JHN
coupling data
Coupling constants were measured at two separate temperatures (12° and 25°C, Fig. 3A,B
). A series of values between 2.5 and 5.0 Hz was observed in the helical regions, as expected. However, in helix 1, 3JHN
values of 6.5 Hz (at 12°C) and 7.5 Hz (at 25°C) were obtained for residue D16, which suggests a break or kink in the helix at this position. In helix 3 it is notable that the 3JHN
values steadily increase from 3.0 to 7.0 Hz between residues Q56 and Y67, suggesting a gradual loosening or stretching toward the C-terminal end of the helix. Regions with 3JHN
values generally greater than 6.0 Hz at 25°C are P1N8, A24N32, A43K49, and K66T79. A lower temperature of 12°C did not significantly affect these values in regions A43K49 and Y67T79. However, 3JHN
coupling constants for the C-terminal end of helix 3 (S60K66) and for the N-terminal residues K4, R5, and M7 are reduced at 12°C by 0.5 to 2.0 Hz (Fig. 3C
). This reduction represents an increase in structural order in this region as the temperature is lowered. Increased structural order at lower temperatures has previously been observed for the mSox-5 HMG box using CD and DSC (Crane-Robinson et al. 1998).
|
i) for fast (picosecond) internal motion, and where necessary a factor Rex introduced for a good fit. The Rex term may be taken to indicate a contribution to the transverse relaxation rate arising from slow (milli- to microsecond) interconversion processes (Clore et al. 1990a) or from specific self-association phenomena (Pfuhl et al. 1999). Where necessary for a good fit an additional order parameter (Sf2) was introduced, which may be taken to indicate motion on an intermediate time-scale (Clore et al. 1990b; Mandel et al. 1995). For the most important parameter to assess fast internal motion, a limiting value of S2 = 1 indicates total restriction and S2 = 0 would indicate no restriction of the internal motion of the amide NH bond.
|
Helices 1 (F10F25) and 2 (N32A43) have mean S2 values of 0.85 and 0.86, with mean {1H}15N NOE values of 0.67 and 0.68, respectively. However helix 3 (residues P51Y67) exhibits mean S2 and {1H}15N NOE values of only 0.78 and 0.52, respectively. These lower values, as compared to the rest of the domain, arise from a series of low S2 and {1H}15N NOE values for residues Q62 to Y67, in the C-terminal end of helix 3. Calculation of the mean S2 and {1H}15N NOE for just the N-terminal end of helix 3 (residues P51K61) yielded values of 0.85 and 0.64, values similar to that found for helices 1 and 2. This indicates that helices 1 and 2 and the N-terminal segment of helix 3 do not display fast internal motion, whereas the six residues at the C-terminal end of helix 3 show a degree of dynamic disorder.
In addition, helix 1 exhibits a trend toward lower S2 values along the helix, suggesting the N-terminal end of the helix has a slightly greater degree of dynamic order than the C-terminal end. Isolated low values occurred near the ends of helices 1 and 2 at residues F25, I33, and W41. For loop 1 slightly low values for the mean S2 and {1H}15N NOE were observed (0.81 and 0.61, respectively), although this was the not case for loop 2 (0.85 and 0.67, respectively). This could suggest that the degree of dynamic order in the loops is as great as in the helices.
The derived Rex terms (data not shown) were widely dispersed throughout mSox-5 and were all small in value (<3.30 sec-1) with the exception of residues L22 (Rex = 4.64) and K49 (Rex = 8.79). Since no concentration dependence on chemical shifts in the 15N1H HSQC spectra was observed, this would argue against self-association and implies that slow conformational interconversion processes occur at residues L22 and K49, in loops 1 and 2, respectively.
Determination of structures
Structures were initially determined using a restraint file for the complete 79-residue mSox-5 HMG box. Families of consistent structures were clearly observed, but these structures showed that beyond residue 70 there was a high degree of conformational variability. Further sets of structures were therefore obtained using a restraint file for residues 170. From 50 calculated structures, the 30 of lowest energy (<1300 kcal mole-1) were selected and further refined in the presence of selected hydrogen bond and dihedral angle restraints, which were based on hard-to-exchange amide proton data (Fig. 2E,F
) and 3JHN
coupling-constant data (Fig. 3
), respectively. After the final minimization, all selected structures contained no distance restraint violations greater than 0.5 Å for both backbone and sidechain distance restraints and no dihedral angle violations greater than 10°.
Deviation from a standard
helix was found within helix 1. The 3JHN
coupling constant for D16 is 7.5 ± 0.5 Hz (Fig. 3A
), a value compatible with a phi angle of -90 ± 10°. Furthermore, the d
N(i, i + 4) NOE between residues V12 and D16 was absent from the NOESY spectra. Taken altogether, this indicates that the hydrogen bond between the carbonyl O of V12 and the NH of D16 is very extended and nonlinear, leading to distortion of helix 1 at this point. This hydrogen bond restraint was therefore not included in the final modeling.
In the C-terminal part of helix 3 (H63Y67), the 3JHN
values for K66 and Y67 are 7.2 and 8.2 Hz, respectively, and all the d
N(i, i + 3) and d
N(i, i + 4) NOE cross-peaks were very weak. Furthermore, an increase in exchange volumes of the amide protons of residues 6067 at the water chemical shift position was observed (Fig. 2E
). The more mobile C-terminal end of helix 3 is also manifested in the longer T2 and lower S2 values for residues 6267 (Fig. 4B,D
). Hydrogen bond restraints were therefore applied only to residues E54Q62 of helix 3.
Residues 170 were further examined for multiconformational states using the Xplor v3.851 ensemble program for cross-validation of structures (Bovin and Brunger 1995,1996). Twenty structures were collected, and the averages were determined for the existence of 1, 2, or 3 conformers. The number of violations greater than 0.2 Å were 38.1, 2.3, and 1.3 (with standard deviations of 3.1, 2.0, and 1.3, respectively) for nonrefined 1-, 2-, and 3-conformer models, respectively. The considerable reduction in violations with increasing number of possible conformers, in particular from 1 to 2, could result either from the fact that there are genuinely two conformers or because in a fold having significant flexibility in parts of the structure, an increase in the number of allowed conformers inevitably leads to fewer violations. We therefore compared the two averaged conformers in the 2-conformer model for clear signs of differences at particular points. In loop 2 there were no significant conformational differences, but in loop 1 the two conformers were different. To decide if this was caused by flexibility or the presence of two genuine conformers, we compared the 3 structures of loop 1 in the 3-conformer model: The third average conformer was intermediate between the first two, and we conclude that loop 1 is indeed flexible. This conclusion accords with the finding that the amide NHs of N30 and S31 are in very rapid solvent exchange (Fig. 2F
).
Description of the mSox-5 HMG box structure
Figure 5A
shows a stereo backbone view of the 30 final structures comprising residues 170 of the mSox-5 HMG box. The structures were overlaid by best-fit superposition of the backbone heavy atoms (amide N, C
, carbonyl C and O) of residues F10F25 and N32K42. These two regions, comprising helices 1 and 2, were chosen for overlay since they were the best defined (number of NOEs per residue >21, with an average of 58) and consistent in position relative to each other, that is, low average pairwise RMSDs. The average pairwise RMSD for the heavy backbone atoms in this set was 0.27 ± 0.13 Å. Superposition of residues F10F25, N32K42, and P51Y67 gave a pairwise RMSD of 1.56 ± 0.90 Å, whereas superposition of all heavy backbone atoms for residues 167 gave a pairwise RMSD of 2.3 ± 1.7 Å.
|
1.0 Å larger than expected for a canonical turn. The following two residues, N30 and S31, are in an extended conformation. In loop 2, M44L47 are in a partially extended conformation, and the following 4 residues, E48P51, have an approximately
-helical conformation with a similar set of NOE contacts to the equivalent residues in HMG1 box 2 (Read et al. 1993; Weir et al. 1993; Read et al. 1995). The minor wing is the long arm of the fold and consists of the extended N-terminal segment P1N8 running alongside and antiparallel to most of helix 3 (E54Y67).
Figure 5B
shows a view of the major wing, highlighting selected sidechains in all of the 30 final structures. A consistent hydrophobic core (F10, W13, L37, W41, M44, and Y52) is seen with the main apolar contacts having a well-defined geometry. In contrast, the sidechains of M11, N30, and S31 point out into the solvent and show no fixed conformation.
The residues involved in forming the apolar core maintain the orientation of the surrounding three
helices (Fig. 5B
). The aromatic rings of F10, W13, and W41 stack onto one another and orient helices 1 and 2. Residues L22 and L37 form hydrophobic contacts between helix 1 and the ß turn of P26H29 and between helices 1 and 2, respectively. The other aromatic residue, F25, makes close contact with the methyl groups of M28 and I21: These serve to maintain the angle between helix 1 and helix 2, as well as stabilizing the ß turn between helices 1 and 2. The orientation of helix 1 to helix 3 depends on apolar contact between the methyl group of A9 and the sidechain of Y52 in helix 3.
In the minor wing, NOEs were observed between the sidechains of L59, H63, and L64 and the backbone residues of M7/N8, R5, and K4, respectively. There were also weak NOEs between the sidechains of H63 and Y67 and the sidechain of I3. These fix the position of the N-terminal segment of the box alongside helix 3. At the C-terminal end of helix 3, residues Y67Y70 form a type III ß turn similar to that found in loop 1. In order to better assess the degree of local order in the minor wing, superpositions were made of residues 5167 (helix 3) and residues 19 in the N-terminal segment. Figure 5C
shows that helix 3 exhibits considerable regularity, whereas the nine N-terminal residues are significantly less ordered but more ordered than appears to be the case from the helix 1/2 superpositions of Fig. 5A
. The disorder in the central part of the N-terminal segment, R5M7, is owing to a lack of NOEs to residue P6, which in turn is caused by chemical shift overlap. However, the apparent decrease in structural order from residues 9 to 1 (Fig. 5C
) is in good accord with the reduction in S2 shown in Figure 4D
.
Conformation of the C-terminal segment
Since a regular NOE intensity pattern was observed for residues lying beyond helix 3, P68R75, that is, medium/strong dNN, d
N(i, i), very strong d
N(i, i + 1), and very weak d
N(i, i + 2), a new set of structures were calculated that included restraints observed for residues 7179. The conformation of the C-terminal region, including and beyond helix 3, was then examined using the following superpositions: 5167, 5177, and 6877. This procedure is similar to that carried out for rHMG1 box 2 (Weir et al. 1993) and for mSox-4 (van Houte et al. 1995). From 50 calculated structures, 30 low-energy structures were selected and refined. The final structures when overlaid by superposition of all backbone heavy atoms for residues 5177 (Fig. 6A
) gave a pairwise RMSD of 3.78 ± 1.0 Å, whereas superposition utilizing only helix 3 (P51Y67, Fig. 6B
) gave an RMSD of 0.99 ± 0.30 Å, a value typical for an ordered structure with some variability. The large difference between these two RMSD values indicates that residues beyond the C-terminal end of helix 3 do not have a particularly fixed position. In order to see if there is any regular structure at all in the region beyond helix 3, residues 6877 were superimposed (Fig. 6C
). The backbone heavy atom RMSD for this region was 2.9 ± 1.1 Å. Figure 6C
shows that the overall conformational envelope of these C-terminal residues (6879) is a hooked shape with a poorly defined turn at P68 (the exit to helix 3) and a second turn centred around P74.
|
| Discussion |
|---|
|
|
|---|
It is striking that superposition of the various HMG box folds using the heavy backbone atoms of helices 1 and 2 (residues 1025 and 3242) yields little variation, with the RMSD values ranging only between 1.00 and 1.72 Å (Table 1
, column 2). When helix 3 (residues 5267 in mSox-5) is included in the superposition, the RMSD values rise to between 1.30 and 3.12 Å (Table 1
, column 4), indicating that the position of helix 3 in the minor wing is not well defined relative to the major wing (which includes helices 1 and 2). Removal of the last 5 residues of helix 3 from the superpositions, that is, using only residues 5262 for helix 3, significantly reduced the RMSD values for some of the boxes (Table 1
, column 3) and reflects the increased uncertainties in the fold between residues 62 and 67.
|
Figure 7A
shows the hydrophobic core for all 6 HMG box folds determined in the absence of DNA, and it is seen that the relative positions of the 4 principal aromatic sidechains (residues 10, 13, 41, and 52) are the same in all 6 folds. Figure 7B
shows the hydrophobic core for the five HMG box/DNA structures (together with the free-solution mSox-5 structure). For these structures, again, the 4 principal aromatic sidechains are in similar orientations, except for hSRY/DNA, for which the positions of the W41 and W13 sidechains are inverted such that it is the sidechain of W13 that contacts residues F52 and F53, rather than the sidechain of W41. A detailed comparison of the conserved aromatic sidechains in the hydrophobic core was made by superimposing them onto those of mSox-5. The pairwise RMSD values obtained for the Cß and C
atoms (Table 1
, column 9) and for all sidechain heavy atoms (Table 1
, column 10) in the 10 HMG box structures do not differ greatly, with the notable exception of hSRY/DNA. This divergence of hSRY is unexpected because the amino acid sequence of mSox-5 is closer to hSRY than to any of the other 9 structures, and there are considerable differences between the sequence of mSox-5 and the non-sequence-specific HMG boxes.
|
The primacy of the minor wing in establishing DNA sequence specificity was shown by a subdomain swap experiment using hLEF-1 and HMG1 box 2 (Read et al. 1994), and this was borne out by the structures of the two sequence-specific HMG box/DNA complexes (Love et al. 1995; Werner et al. 1995). Several amino acids in the minor wing have been singled out as critical for sequence-specific DNA binding: N8 is involved in hydrogen bonding and electrostatic interactions with three bases and forms the stem of an amino acid wedge that forces intercalation of the sidechain of residue 11 between two adenine rings (Werner et al. 1996), and it is likely that this mechanism operates for mSox-5/DNA binding and involves N8W13, with M11 as the intercalating residue. In non-sequence-specific HMG boxes, residue 8 is normally serine and never asparagine. At position 11 in sequence-specific boxes the residue is either methionine or isoleucine (occasionally phenylalanine), and in non-sequence-specific boxes it is one of several large hydrophobics, with the notable exception of box 1 of mammalian HMG1 and HMG2, in which it is alanine. Indeed, this residue was found not to be intercalated into the cisplatinated DNA of the complex with rHMG1 (Ohndorf et al. 1999). It seems therefore that for most non-sequence-specific HMG boxes there are two DNA-intercalating residues but only one for sequence-specific boxes.
Residues V3, Y67, and Y70 in hSRY pack together so as to present a precise surface of the N-terminal region to the DNA, which appears critical for DNA sequence-specific recognition (Werner et al. 1995, 1996): In DNA non-sequence-specific boxes, residue 3 is normally proline (Read et al. 1995) and never valine or isoleucine (as in mLEF-1). For this hydrophobic cluster of 3 sidechains to form it appears necessary for the backbone to turn through an approximate right angle at the end of helix 3, a conformation very evident in the structures of hSRY and mLEF-1 bound to DNA (Love et al. 1995; Werner et al. 1995). A proline residue (P68 in mSox-5) seems essential for this change in backbone direction, and proline is conserved at this position in all sequence-specific HMG boxes but absent from all non-sequence-specific boxes (Ner 1992). In free-solution mSox-5, P68 is part of a turn (Y67Y70) that also results in a change of chain direction by
90° (Fig. 6
).
The minor wing of the mSox-5 fold
The structural independence of the minor wing of the HMG box fold from its major wing is indicated by several criteria. In our early studies of the domain structure of HMG1 using limited trypsin digestion (Cary et al. 1983) it was found that box 1 was preferentially cut at the C-terminal sides of residues 6 and 60, positions now seen to correspond to the boundary between the two wings. Importantly, the remaining major wing fragment was shown to be fully folded, demonstrating that the major wing does not require the minor wing in order to fold (Cary et al. 1983). In later experiments we showed that it was possible to construct a folded chimeric HMG box from the major wing of HMG1 box 2 and the minor wing of hLEF-1 that preserved the sequence-specific DNA-binding characteristics of hLEF-1 (Read et al. 1994).
A difference between mSox-5 and non-sequence-specific boxes in free solution is the increased flexibility in helix 3 of mSox-5. The relaxation data in Figure 4
show a steady decrease in the order parameter S2 beyond residue Q62, 6 residues before the end of helix 3. In marked contrast, the equivalent residues in helix 3 of non-sequence-specific boxes (both boxes from HMG1 [Broadhurst et al. 1995] and dHMG-D [Jones et al. 1994]) showed no increase in dynamic disorder. This might be related to a requirement for a higher level of induced fit in the binding of sequence-specific HMG boxes to DNA. The importance of protein refolding in DNA sequence recognition has recently been discussed (Wright and Dyson 1999).
A DSC and CD study demonstrated that the mSox-5 HMG box denatures as two separate subdomains. The lower melting subdomain was assigned to the minor wing on the basis of changes in the intrinsic fluorescence and NMR spectrum (Crane-Robinson et al. 1998). DSC and CD melting studies of HMG1 box 2 also show the presence of two subdomains, but the stability of the lower melting minor wing is substantially greater than for mSox-5 (P.D. Cary, C.M. Read, C. Crane-Robinson and P.L. Privalov, unpubl.). Deconvolution of the calorimetric Cp/T function observed for mSox-5 suggested that the melting of the minor wing is a cooperative transition with a Tm of 34°C that partially overlaps the melting of the major wing (Tm = 46°C). The relaxation measurements obtained at 25°C (Fig. 4
) are revealing in this respect. If a subdomain denatures cooperatively in a two-state process, then at some defined intermediate temperature one would expect the mobility of all residues in the subdomain to be the same and, in a fast exchange situation, to reflect the relative proportions of the native and denatured states at that temperature. But that is not what is seen in Figure 4
for the minor wing (residues 54 to the C-terminus and 18): The degree of dynamic order of residues 5461 is high, but that for the succeeding C-terminal residues gradually decreases. The degree of dynamic order of the first 8 N-terminal residues also varies with its position in the chain. This is what would be expected if temperature increase resulted in a gradual unfolding of the minor wing (from the N- and C-terminal ends) rather than a cooperative melting of the whole wing. Relaxation measurements at several temperatures would be required to absolutely prove this point, but such considerable differences in mobility along the minor wing measured at a single temperature leave little doubt that its melting is a continuous process.
| Materials and methods |
|---|
|
|
|---|
The domain studied represents an 81-residue peptide having two additional amino acids (GlySer) at the N terminus of the 79-residue HMG box of mSox-5 (for amino acid sequence, see Fig. 1
). The purified recombinant mSox-5 HMG box showed a single band on both SDS and acetic acid/urea polyacrylamide gels. Electrospray mass spectrometry gave the expected mass of 9804.6 Da. CD spectroscopy of the refolded mSox-5 HMG box indicated an
-helical content of
55%. Gel retardation studies showed a band of reduced electrophoretic mobility on binding the protein to a 12-bp DNA duplex containing the recognition sequence 5'-AACAAT-3', and a circular permutation assay gave a bend angle of 70° (Privalov et al. 1999).
NMR spectroscopy
Two- and three-dimensional NMR spectra were obtained on a home-built General Electrics Omega 600-MHz spectrometer (OCMS, University of Oxford) in 90% H2O/10% 2H2O at 25°C, pH 6.2. Further 2D NMR spectra were obtained at 25°C and 32°C at pH values of 6.2 and 5.4 in 90% H2O/10% 2H2O and in 100% 2H2O in order to resolve problems of cross-peak overlap. The spectra obtained at pH 5.4 were recorded on a Bruker AM600 at Oxford University. Protein concentrations of
3 mM and 2 mM were used for the 2D and 3D spectra, respectively, in 75 mM potassium phosphate, 0.5 mM DTT. 1H1H NOESY spectra (Jeener et al. 1979; Kumar et al. 1980; Kieffer et al. 1994) were collected in the phase-sensitive manner by the time proportional increments method, with mixing times of 120 msec (pH 6.2) and 130 msec (25°C, pH 5.4)/160 msec (32°C, pH 5.4). 1H1H HOHAHA spectra (Braunschweiler et al. 1983; Davis and Bax 1985; Bax et al. 1987; Kieffer et al. 1994) were collected with a mixing time of 32 msec (pH 6.2) and 35 or 40 msec (pH 5.4), as previously described. Care was taken to optimize baseline flatness in the spectra by appropriate choice of the initial t1 and preacquisition delays (Marion and Bax 1988; Bax et al. 1991). The solvent signal was removed in the F2 dimension by time domain deconvolution (Marion et al. 1989b). Three-dimensional 1H NOESY 15N1H HMQC and 1H HOHAHA 15N1H HMQC experiments (Marion et al. 1989a; Messerle et al. 1989; Driscoll et al. 1990) were recorded with 120-msec and 40-msec mixing times, respectively, using 128 x 32 x 512 complex points with spectral widths of 6 kHz (F1), 2 kHz (F2), and 12 kHz (F3). NMR data were processed using the program Felix v2.30 (MSI/Biosym Technologies) and displayed using NMRview (Johnson and Blevins 1994).
Further NMR data were acquired on a Varian UnityPlus 500-MHz spectrometer at University College London. 3JHN
coupling constants, and their temperature dependence, were determined from 15N1H HMQC-J spectra sequentially recorded at 12°C and 25°C. 3JHN
coupling constants were evaluated with a nonlinear least-squares curve fit to the observed peaks using an analytical expression for the cross-peak lineshape, including the effects of antiphase dispersive character, cross-correlation, and apodization, according to previously described procedures (Norwood et al. 1992). The 15N1H HMQC-J experiments were also used to define the solvent exchange rate of backbone amide NH protons.
The 15N T1, T2, and {1H}15N heteronuclear NOE data (Kay et al. 1989) were recorded with a 1 mM mSox-5 sample at 25°C, at a 15N frequency of 50.6 MHz. The data were analyzed using the LipariSzabo model-free formalism (Lipari and Szabo 1982a, b) according to the procedures detailed in Pfuhl et al. (1999) and Kristensen et al. (2000) and are essentially identical to those of Palmer and co-workers (Mandel et al. 1995). The 15N T1 and T2 values were estimated by fitting the heights from any given NH peak to a two-parameter exponential decay function. Steady-state {1H}15N NOE values were calculated as NOE =
sat/
unsat, where
sat and
unsat are the average signal heights in the presence and absence of 1H presaturation, respectively. All spectra were processed with Felix v2.30, and signals were integrated with Xeasy (Bartels et al. 1995). The uncertainties of the signal heights were estimated from the rms noise level in the spectra. Uncertainties in T1 and T2 times were estimated from Monte Carlo simulation, and uncertainties of the steady-state {1H}15N NOE were calculated by error propagation.
Structure calculations
Calibration of NOE-derived distances was based on known interatomic spacings within aromatic rings (F10, W13, F25, W41, and Y52) and checked using the well-defined
helices (averaged distances for i to i + 1, i + 2, i + 3, and i + 4). All assigned NOE cross-peaks were then classified as either strong, medium, or weak and for the 3D spectra the distance restraints for the 3 categories were set at 1.82.7, 1.83.5, and 1.85.0 Å, respectively. Since a number of 2D spectra involved mixing times of 120160 msec, the possibility of spin diffusion effects meant that these spectra had to be recalibrated. It was found, for example, in the case of 2D spectra recorded in 2H2O, that interactions of up to 7.0 Å could be observed; accordingly, distance categories of 1.82.7, 1.83.5, 1.85.5, and 1.87.0 Å were applied to these spectra. Stereospecific assignment (Gronenborn et al. 1991) of the methyl protons of V12, L22, and L39 was achieved from 2D NOESY data in which clear chemical shift differences were observed in conjunction with differential intensities.
Xplor v3.851 (Brunger 1993) was used to calculate structures from an initial pseudorandom coordinate template file. Distance NOE restraint files were written as described by Wuthrich et al. (1983), and redundant NOE restraints were eliminated with Aqua v2.0 (Laskowski et al. 1996). Hydrogen bonds and dihedral angles within expected helices were initially set on the basis of: (1) hard-to-exchange backbone NH protons [NOE volumes measured at the NH (15N1H HSQC) and NH and H2O (3D NOESY) chemical shift positions]; (2) continuous stretches of d
N(i, i + 3) and d
N(i, i + 4) NOE cross-peaks; (3) C
H proton chemical shifts (Wishart et al. 1992); and (4) 3JHN
coupling constants. Restraints for phi dihedral angles were determined using the equation 3JHN
= 6.98 cos2
- 1.38 cos
+ 1.72. Psi dihedral angles were not restrained. This led to the use of 1424 (1383) distant NOE restraints, 68 (61) dihedral restraints, and 24 (24) pairs of hydrogen bond restraints, on structures modeled for residues 179 and 170, respectively.
Simulated annealing was carried out using center averaging for pseudo atom positions using the soft potential for all initial calculations, and these were refined until all NOE restraints, including hydrogen bond violations, were less than 0.7 Å for all structures. Low-energy structures selected from these were further refined to produce a consistent family of structures with distance NOE violations less than 0.5 Å. Structures with energies less than 1300 kcal mole-1 were then refined, from which 30 were selected having energies less than 500 kcal mole-1. Multiconformer analysis of the 170-residue domain was carried out using Xplor v3.851 with the lowest energy refined structure as initial template and the same restraint files as used previously. Ensembles of 20 structures were examined for each conformer using a violation setting of 0.2 Å, for a maximum of 3 conformer ensembles.
| Acknowledgments |
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked ``advertisement'' in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
Bartels, C., Xia, T.H., Billeter, M., Guntert, P., and Wuthrich, K. 1995. The program XEASY for computer-supported NMR spectral analysis of biological macromolecules. J. Biomol. NMR 5: 110.[CrossRef][Medline]
Bax, A., Sklenar, V., Clore, G.M., and Gronenborn, A.M. 1987. Water suppression in two-dimensional spin-locked nuclear resonance experiments using a novel phase-cycling procedure. J. Amer. Chem. Soc. 109: 65116513.[CrossRef]
Bax, A., Ikura, M., Kay, L.E., and Zhu, G. 1991. Removal of F1-base-line distortion and optimisation of folding in multidimensional NMR-spectra. J. Magn. Reson. 91: 174178.
Bazett-Jones, D.P., Leblanc, B., Hertfort, M., and Moss, T. 1994. Short-range DNA looping by the Xenopus HMG-box transcription factor, xUBF. Science 264: 11341137.
Bianchi, M.E., Beltrame, M., and Paonessa, G. 1989. Specific recognition of cruciform DNA by nuclear protein HMG1. Science 243: 10561059.
Bianchi, M.E., Falciola, L., Ferrari, S., and Lilley, D.M.J. 1992. The DNA binding site of HMG1 protein is composed of two similar segments (HMG boxes), both of which have counterparts in other eukaryotic regulatory proteins. EMBO J. 11: 10551063.[Medline]
Bovin, A.M.J.J. and Brunger, A.T. 1995. Variablity of solution nuclear magnetic resonance structures. J. Mol. Biol. 250: 8093.[CrossRef][Medline]
. 1996. Do NOE distances contain enough information to assess the relative populations of multi-conformer structures? J. Biomol. NMR 7: 7276.[Medline]
Braunschweiler, L. and Ernst, R.R. 1983. Coherence transfer by isotropic mixing: Application to protein correlation spectroscopy. J. Magn. Reson. 53: 521528.
Broadhurst, R.W., Hardman, C.H., Thomas, J.O., and Laue, E.D. 1995. Backbone dynamics of the A-domain of HMG1 as studied by 15N NMR spectroscopy. Biochemistry 34: 1660816617.[CrossRef][Medline]
Brunger, A.T. 1993. XPLOR: A system for X-ray crystallography and NMR. Yale University, New Haven, CT.
Cary, P.D., Turner, C.H., Mayes, E., and Crane-Robinson, C. 1983. Conformation and domain structure of the non-histone chromosomal proteins, HMG 1 and 2. Isolation of two folded fragments from HMG 1 and 2. Eur. J. Biochem. 131: 367374.[Medline]
Clore, G.M., Driscoll, P.C., Wingfield, P.T., and Gronenborn, A.M. 1990a. Analysis of the backbone dynamics of interleukin-1ß using 2-dimensional inverse detected heteronuclear 15N1H NMR spectroscopy. Biochemistry 29: 73877401.[CrossRef][Medline]
Clore, G.M., Szabo, A., Bax, A., Kay, L.E., Driscoll, P.C., and Gronenborn, A.M. 1990b. Deviations from the simple 2-parameter model-free approach to the interpretation of 15N nuclear magnetic-relaxation of proteins. J. Amer. Chem. Soc. 112: 49894991.[CrossRef]
Connor, F., Cary, P.D., Read, C.M., Preston, N.S., Driscoll, P.C., Denny, P., Crane-Robinson, C., and Ashworth, A. 1994. DNA binding and bending properties of the post-meiotically expressed Sry-related protein Sox-5. Nucleic Acids Res. 22: 33393346.
Crane-Robinson, C., Read, C.M., Cary, P.D., Driscoll, P.C., Dragan, A.I., and Privalov, P.L. 1998. The energetics of HMG box interactions with DNA. Thermodynamic description of the box from mouse Sox-5. J. Mol. Biol. 281: 705717.[CrossRef][Medline]
Davis, D.G. and Bax, A. 1985. Assignment of complex 1H NMR spectra via two-dimensional homonuclear HartmannHahn spectroscopy. J. Amer. Chem. Soc. 107: 28202821.[CrossRef]
Denny, P., Swift, S., Connor, F., and Ashworth, A. 1992. An Sry-related gene expressed during spermatogenesis in the mouse encodes a sequence-specific DNA-binding protein. EMBO J. 11: 37053712.[Medline]
Driscoll, P.C., Clore, G.M., Marion, D., Wingfield, P.T., and Gronenborn, A.M. 1990. Complete resonance assignment for the polypeptide backbone of interleukin 1ß using three-dimensional heteronuclear NMR spectroscopy. Biochemistry 29: 35423556.[CrossRef][Medline]
Ferrari, S., Harley, V.R., Pontiggia, A., Goodfellow, P.N., Lovell-Badge, R., and Bianchi, M.E. 1992. SRY, like HMG1, recognizes sharp angles in DNA. EMBO J. 11: 44974506.[Medline]
Giese, K., Amsterdam, A,. and Grosschedl, R. 1991. DNA-binding properties of the HMG domain of the lymphoid-specific transcriptional regulator LEF-1. Genes & Dev. 5: 25672578.
Giese, K., Cox, J., and Grosschedl, R. 1992. The HMG domain of lymphoid enhancer factor 1 bends DNA and facilitates assembly of functional nucleoprotein structures. Cell 69: 185195.[CrossRef][Medline]
Gronenborn, A.M., Filpula, D.R., Essig, N.Z., Achari, A., Whitlow, M., Wingfield, P.T., and Clore, G.M. 1991. A novel, highly stable fold of the immunoglobulin binding domain of streptococcal protein-G. Science 253: 657661.
Groschedl, R., Giese, K., and Pagel, J. 1994. HMG domain proteinsarchitectural elements in the assembly of nucleoprotein structures. Trends Genet. 10: 94100.[CrossRef][Medline]
Hardman, C.H., Broadhurst, R.W., Raine, A.R.C., Grasser, K.D., Thomas, J.O., and Laue, E.D. 1995. Structure of the A-domain of HMG1 and its interaction with DNA as studied by heteronuclear three- and four-dimensional NMR spectroscopy. Biochemistry 34: 1659616607.[CrossRef][Medline]
Jantzen, H.M., Adam, A., Bell, S.P., and Tjian, R. 1990. Nucleolar transcription factor hUBF contains a DNA-binding motif with homology to HMG proteins. Nature 344: 830836.[CrossRef][Medline]
Jeener, J., Meier, B.H., Bachmann, P., and Ernst, R.R. 1979. Investigation of exchange processes by two-dimensional NMR spectroscopy. J. Phys. Chem. 71: 45464553.[CrossRef]
Johns, E.W. 1982. The HMG chromosomal proteins. Academic Press, New York, NY.
Johnson, B.A. and Blevins, RA. 1994. NMRView: A computer programme for the visualization and analysis of NMR data. J. Biomol. NMR 4: 603614.[CrossRef]
Jones, D.N.M., Searles, M.A., Shaw, G.L., Churchill, M.E.A., Ner, S.S., Keeler, J., Travers, A.A., and Neuhaus, D. 1994. The solution structure and dynamics of the DNA-binding domain of HMG-D from Drosophila melanogaster. Structure 2: 609627.
Kay, L.E., Torchia, D.A., and Bax, A. 1989. Backbone dynamics of proteins as studied by 15N inverse detected heteronuclear NMR-spectroscopyApplication to Staphylococcal nuclease. Biochemistry 28: 89728979.[CrossRef][Medline]
Kieffer, B., Driscoll, P.C., Campbell, I.D., Willis, A.C., van der Merwe, P.A., and Davis, S.J. 1994. 3-dimensional solution structure of the extracellular region of the complement regulatory protein CD59, a new cell-surface protein domain related to snake-venom neurotoxins. Biochemistry 15: 44714482.
Kristensen, S.M., Siegal, G., Sankar, A., and Driscoll, P.C. 2000. Backbone dynamics of the C-terminal SH2 domain of the p85
subunit of phosphoinositide 3-kinase: Effect of phosphotyrosine peptide binding and characterization of slow conformational exchange processes. J. Mol. Biol. 299: 771788.[CrossRef][Medline]
Kumar, A., Ernst, R.R., and Wüthrich, K. 1980. A two-dimensional nuclear overhauser enhancement (2D NOE) experiment for the elucidation of complete protonproton cross-relaxation networks in biological macromolecules. Biochem. Biophys. Res. Commun. 95: 16.[CrossRef][Medline]
Laskowski, R.A., Rullmann, J.A.C., MacArthur, M.W., Kaptein, R., and Thornton, J.M. 1996. AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMR. J. Biomol. NMR 8: 477486.[Medline]
Lefebvre, V., Li, P., and de Crombrugghe B. 1998. A new long form of Sox5 (L-Sox5), Sox6 and Sox9 are coexpressed in chondrogenesis and cooperatively activate the type II collagen gene. EMBO J. 17: 57185733.[CrossRef][Medline]
Lipari, G. and Szabo, A. 1982a. Model-free approach to the interpretation of nuclear magnetic resonance relaxation in macromolecules. 1. Theory and range of validity. J. Amer. Chem. Soc. 104: 45464559.[CrossRef]