Influence of length and flexibility of spacers on the binding affinity of divalent ligands

Susanne Liese; Roland R. Netz

doi:10.3762/bjoc.11.90

/ E-Alerts

Influence of length and flexibility of spacers on the binding affinity of divalent ligands

Fachbereich für Physik, Freie Universität Berlin, 14195 Berlin, Germany

Corresponding author email

This article is part of the Thematic Series "Multivalency as a chemical organization and action principle".

Guest Editor: R. Haag
Beilstein J. Org. Chem. 2015, 11, 804–816. https://doi.org/10.3762/bjoc.11.90
Received 03 Mar 2015, Accepted 29 Apr 2015, Published 15 May 2015

Full Research Paper

PDF

Album

Supp Info

Cite

Abstract

We present a quantitative model for the binding of divalent ligand–receptor systems. We study the influence of length and flexibility of the spacers on the overall binding affinity and derive general rules for the optimal ligand design. To this end, we first compare different polymeric models and determine the probability to simultaneously bind to two neighboring receptor binding pockets. In a second step the binding affinity of divalent ligands in terms of the IC₅₀ value is derived. We find that a divalent ligand has the potential to bind more efficiently than its monovalent counterpart only, if the monovalent dissociation constant is lower than a critical value. This critical monovalent dissociation constant depends on the ligand-spacer length and flexibility as well as on the size of the receptor. Regarding the optimal ligand-spacer length and flexibility, we find that the average spacer length should be equal or slightly smaller than the distance between the receptor binding pockets and that the end-to-end spacer length fluctuations should be in the same range as the size of a receptor binding pocket.

Keywords: binding affinity; divalent ligand; effective concentration; multivalency

Introduction

Multivalency is a common design principle in biological systems. The simultaneous binding of several, relatively weakly binding partners is a widely used strategy to strengthen the overall binding affinity [1-3]. Multivalency is believed to play an important role in evolutionary processes, since the collective interaction of several rather simple ligands makes the development of more complex binding partners with a higher binding affinity unnecessary [2]. Also in drug design, the synthesis of artificial multivalent ligands is a promising route to increase the binding affinity or to reduce the amount of substance required for treatment [4-7].

The term multivalency is used for systems that consist of several identical binding partners. Thereby, the larger binding partner, for example a protein, is commonly denoted as receptor, whereas the smaller binding partner, for instance an enzyme or a single molecule, is denoted as ligand. The binding strength of a multivalent structure significantly depends on details of the presentation of ligands and receptors [1]. Each multivalent ligand consists of several monovalent ligands that are connected via a scaffold. The binding affinity of such a multivalent ligand is determined by the interplay between gain in binding energy and loss of entropy associated with conformational degrees of freedom. The more flexible the scaffold is, the better it can adapt to the geometry of the receptor, but the more pronounced on the other hand is the entropy penalty. This simple, qualitative argument shows that the careful choice of the ligand scaffold is essential, in order to benefit from multivalent enhancement. It is therefore desirable to derive a model that allows one to predict the binding affinity of a given ligand-scaffold construct. Several theoretical studies have been dedicated to study the interaction between multi- and polyvalent ligands with receptors arranged on planar surfaces [8-13]. The overwhelming variety of multivalent ligand architectures that range from small divalent ligands to densely packed nanoparticles, led to different approaches to describe multivalency, depending on the size and valency of the system. Several studies aimed to treat ligand–receptor systems with different structures and valencies in the framework of a generalized theory [14,15].

The smallest multivalent system consists of a divalent ligand that interacts with a divalent receptor. Despite its seeming simplicity, the rational design of divalent ligands is still challenging [16-19]. In this paper we examine a general model for a divalent receptor–ligand system in order to estimate the binding affinity from the dissociation constant of the monovalent ligand and the length and flexibility of the ligand spacer.

Figure 1a schematically depicts a divalent ligand–receptor system. The receptor possesses two binding pockets at a distance d from each other. A binding range of σ characterizes each binding pocket. The divalent ligand consists of two ligand units that are connected via a spacer. The contour length of the spacer is denoted as L. There are three different modes in which a divalent ligand can bind to a divalent receptor. Each of these binding modes has a different number of realization possibilities as summarized in Figure 1b: (1) One binding pocket is occupied by one ligand. (2) Two binding pockets are occupied by two ligands. (3) Two binding pockets are occupied by one ligand. The binding affinity in the latter case is strongly influenced by the conformational linker properties, which can be conveniently discussed in terms of the effective concentration. The effective concentration describes the local concentration of one ligand unit close to one binding pocket, if the other ligand unit is assumed to be bound to the other binding pocket. The effective concentration thus corresponds to the probability that the spacer extends to an end-to-end distance that is equal to d, if spacer–receptor interactions are neglected [20]. In the first section different models for the effective concentration are discussed, with particular focus on the influence of the spacer stiffness and the binding range σ.

[1860-5397-11-90-1] — **Figure 1:** (a) Schematic of a divalent ligand–receptor system: The receptor has two binding pockets with a distance d from each other and a binding range σ. The ligand consists of two identical ligand units, connected via a spacer of contour length L. The end-to-end distance of the ligand is denoted as r. (b) Binding modes of a divalent ligand: (1) One ligand occupies one binding pocket. (2) Two ligands occupy two binding pockets. (3) One ligand occupies both binding pockets.

**Figure 1:** (a) Schematic of a divalent ligand–receptor system: The receptor has two binding pockets with a dis...

Jump to Figure 1

For each binding mode depicted in Figure 1b the following dissociation constants are derived: (1) The dissociation constant is equal to the dissociation constant of the monovalent ligand, K_mono, multiplied by a factor of 1/α, which accounts for the reduced degrees of freedom of the spacer, since it cannot penetrate the receptor. The parameter α can adopt value between 0 and 1. In the limiting case, in which the spacer sterically inhibits the ligand unit from binding to the receptor, α becomes 0. In the hypothetical case, in which the conformational degrees of freedom of the spacer do not reduce at all when binding to a receptor, the parameter α becomes 1. (2) Each ligand contributes with a factor of K_mono/α to the dissociation constant. (3) The dissociation constant consists of the monovalent dissociation constant for each ligand times the probability that the spacer bridges the two binding pockets. A detailed derivation of the dissociation constants is presented in Supporting Information File 1. Furthermore, Figure 1b summarizes the combinatorial factors for each binding mode that count the number of equivalent permutations. We regard the divalent ligands as distinguishable, we note in passing that this could reflect polymeric spacers that exhibit chemical asymmetry. Our final results do not depend on whether we assume indistinguishable ligand units or not.

Results and Discussion

Effective concentration – wormlike-chain model

Samuel and Sinha [21] developed an exact method to describe the conformational statistics of wormlike chains for the whole range from short to long polymers. Their model is applied here to determine the effective concentration C_eff, which is equivalent to the end-to-end distance probability distribution, with the normalization [Graphic 1] . An example is shown in Figure 2. The length of the fully extended spacer L is set to 5 nm. The effective concentration, i.e., the probability that a spacer of given length and stiffness extends to a certain end-to-end-distance d, is shown for different persistence lengths l_p. The flexible spacer (l_p = 1 nm) exhibits a maximum at d = 0. Furthermore, the distribution is very broad, indicating that a flexible spacer can easily bridge two binding pockets, even if the spacer length does not exactly match the inter binding pocket distance d. For a slightly stiffer spacer (l_p = 1.3 nm), C_eff is even broader, but the maximum of C_eff is reduced by a factor of about one half and the distribution shows a plateau between d = 0 nm and d = 3 nm. For stiff spacers (l_p = 5 nm and l_p = 10 nm), C_eff exhibits a narrow peak close to the fully extended state. In the bound state, the ligand units explore the range σ of a receptor binding pocket. Hence, it is useful to consider the effective concentration averaged over the range of both binding pockets. We denote the averaged effective concentration as [Graphic 2] with

with V_bp the volume of one binding pocket, r₁ and r₂ the positions within the first and second binding pocket. We introduce the connecting vector r = |r₁ − r₂| and express r in spherical coordinates:

with r the distance between the two ligand units, θ the angle between r and the connecting vector of the binding pocket midpoints and φ an angle that describes the rotation around the connecting vector of the binding pocket midpoints. Since the range of the binding pocket σ is assumed to be much smaller than the distance between the binding pockets d, we conclude that the integrals in Equation 2 approximately factorize. Furthermore, the size of the binding pocket limits the range over which the angle θ can vary. In the range, where r varies between d − σ and d + σ, the angle θ can adopt a maximum value of arctan(σ/r). The upper limit for the integration over θ then reads

[Graphic 3]

The integration over r can now be described by variations of r in the range from d − σ and d + σ. With these approximations, Equation 2 can be written as an effective average over one dimension:

In Figure 2, the averaged effective concentration is shown as green, dashed lines, with σ = 0.25 nm. A flexible spacer can easily extend to all positions within the binding pockets. Hence, one cannot observe any significant difference between [Graphic 4] and C_eff. In contrast, a very stiff spacer cannot explore the whole binding pocket. Therefore, the averaged effective concentration is reduced and slightly broadened around its maximum, as can be seen best in Figure 2 for l_p = 10 nm.

[1860-5397-11-90-2] — **Figure 2:** Effective concentration Ceff of spacers with a contour length of L = 5 nm as a function of the distance between the binding pockets. The effective concentration is shown for different spacer stiffness, in terms of different persistence lengths between l_p = 1–10 nm (continuous lines). The effective concentration , averaged over a binding pocket range σ = 0.25 nm, is shown as green, dashed lines.

**Figure 2:** Effective concentration Ceff of spacers with a contour length of L = 5 nm as a function of the dist...

Jump to Figure 2

[Graphic 5] — **Figure 2:** Effective concentration Ceff of spacers with a contour length of L = 5 nm as a function of the distance between the binding pockets. The effective concentration is shown for different spacer stiffness, in terms of different persistence lengths between l_p = 1–10 nm (continuous lines). The effective concentration , averaged over a binding pocket range σ = 0.25 nm, is shown as green, dashed lines.

**Figure 2:** Effective concentration Ceff of spacers with a contour length of L = 5 nm as a function of the dist...

Jump to Figure 2

Figure 3 summarizes the averaged end-to-end distance r_ete, the end-to-end distance that corresponds to a maximum in C_eff, r_max, the variance of the end-to-end distance distribution Δr, the maximum of the effective concentration [Graphic 6] and the effective concentration at r_ete, C_eff(r_ete), for different persistence lengths. The influence of the binding range σ is neglected here. The average end-to-end distance r_ete increases monotonically with increasing persistence length and approaches the contour length L for very stiff spacers. All other quantities reveal a clear-cut difference between the flexible and stiff limits. The classification “flexible” and “stiff” is, of course, to some degree arbitrary. We here apply a definition that is based on the discontinuity in r_max, which is the most prominent feature in the chain observables. In the following, spacers with a persistence length smaller than 0.26L are called flexible and spacers with a persistence length larger than 0.26L are called stiff. The variance Δr exhibits a maximum around l_p = 0.26L, for stiffer spacers Δr reduces rapidly. As can be seen in Figure 3, the variance Δr depends on the persistence length as Δr = 0.1L²/l_p (dotted line) for stiff spacers. Mac Kintosh et al. found the same scaling for the fluctuations of semiflexible polymers [22]. The maximum of the effective concentration [Graphic 7] (continuous line) as well as the effective concentration at r_ete, C_eff(r_ete), (dashed line) are minimal in the same region where Δr is maximal. Since for a stiff spacer r_max and r_ete are both close to L, [Graphic 8] and C_eff(r_ete) exhibit only small deviations from each other. For flexible spacers on the other hand, C_eff(r_ete) can be much smaller than the maximal effective concentration. The results presented here show that neither the persistence length nor the contour length alone are sufficient to describe the behavior of the effective concentration, rather the ratio between persistence length and contour length, l_p/L, characterizes the conformational behavior. Note that for a typical receptor distance of d = 5 nm, DNA molecules with l_p = 53 nm are characterized by a ratio l_p/L ≈ 10 and thus correspond to the very stiff limit. Polyethylene glycol (PEG) with a persistence length of about l_p = 0.38 nm on the other hand is characterized by a ratio smaller than l_p/L = 0.08 and thus correspond to the flexible limit [23].

[1860-5397-11-90-3] — **Figure 3:** Average end-to-end distance, r_ete, end-to-end-distance where the effective concentration C_eff exhibits a maximum, r_max, variance of the end-to-end distance distribution, Δr, maximum of the effective concentration, (continuous line), and effective concentration at r_ete, C_eff(r_ete) (dashed line), in dependence of the persistence length l_p. All lengths are measured in units of the spacer contour length L. Spacers with a persistence length l_p < 0.26L are called flexible. Spacers with a persistence length l_p > 0.26L are called stiff. For stiff spacers the relation between Δr/L and the persistence length is well described by Δr/L = 0.1L/l_p (dotted line).

**Figure 3:** Average end-to-end distance, r_ete, end-to-end-distance where the effective concentration C_eff exhib...

Jump to Figure 3

[Graphic 9] — **Figure 3:** Average end-to-end distance, r_ete, end-to-end-distance where the effective concentration C_eff exhibits a maximum, r_max, variance of the end-to-end distance distribution, Δr, maximum of the effective concentration, (continuous line), and effective concentration at r_ete, C_eff(r_ete) (dashed line), in dependence of the persistence length l_p. All lengths are measured in units of the spacer contour length L. Spacers with a persistence length l_p < 0.26L are called flexible. Spacers with a persistence length l_p > 0.26L are called stiff. For stiff spacers the relation between Δr/L and the persistence length is well described by Δr/L = 0.1L/l_p (dotted line).

**Figure 3:** Average end-to-end distance, r_ete, end-to-end-distance where the effective concentration C_eff exhib...

Jump to Figure 3

Effective concentration – harmonic spring and Gaussian chain approximation

The wormlike-chain model requires complex numerical analysis for the calculation of conformational chain properties. In a simplified model the spacer statistics can be described as a harmonic spring or a Gaussian chain with suitably chosen parameters. The advantage of this model is that the effective concentration can be derived in closed form. Furthermore, we show that despite its simplified assumptions the model accurately reproduces the effective concentration C_eff(r_ete) for flexible as well as for stiff spacers.

Stiff spacer – harmonic spring approximation

A stiff spacer is on average extended to almost its full length. The fluctuations around its most probable end-to-end distance r₀ are assumed to be much smaller than the contour length L. We approximate the free energy F, similar to a harmonic spring, as

with k the effective spring constant and d the end-to-end distance. The effective concentration C_eff(d), i.e., the normalized probability to extend the spacer to a certain end-to-end distance d, reads

The averaged effective concentration [Graphic 10] as defined in Equation 3 then becomes:

In order to express the effective concentration in term of the experimentally more relevant average end-to-end distance r_ete and the variance Δr, we first have to determine the relation between r_ete and Δr on the one side and k and r₀ on the other side.

From the free energy F in Equation 4 the average end-to-end distance r_ete and the variance Δr are obtained as:

Note that according to our notation, the average end-to-end distance r_ete is not equivalent to the root mean squared end-to-end distance [Graphic 11] . The variance Δr hence reads:

Using Equation 6 and the results for Δr and r_ete in terms of the model parameters k and r₀ in the stiff spacer limit [Graphic 12] , the averaged effective concentration reads:

For a fixed distance d that has to be spanned by the ligand, the effective concentration becomes maximal for r_ete = d and we obtain, for this optimized spacer length, the result:

Furthermore, we can differentiate between two cases: 1) the chain fluctuations are smaller than the binding range (Δr << σ) and 2) the chain fluctuations are larger than the binding range (Δr >> σ), leading to

We see that in both limits, the maximal effective concentration decreases quadratically with the distance d. More importantly, increasing the stiffness of the spacer (decreasing Δr) increases the effective concentration, but only until the variance Δr becomes of the same order as the binding range σ. For even stiffer spacers the effective concentration stagnates, as can be seen in Equation 15. We conclude that it is not advantageous to increase the spacer stiffness beyond the situation where the end-to-end distance variance Δr becomes smaller than the receptor binding range σ. To compare this model with the wormlike-chain model Equation 16 is rewritten as:

As can be seen in Figure 4a Equation 17 describes the behavior of stiff wormlike chains very well.

[1860-5397-11-90-4] — **Figure 4:** Effective concentration for the optimized average end-to-end distance r_ete=d for the wormlike chain model (continuous line) and the harmonic spring model Equation 17 (dotted line, subfigure a) as well as the Gaussian-chain model Equation 25 (dotted line, subfigure b). In the calculation, we vary the ratio between persistence length and contour length l_p/L, which results in different ratios Δr/d and d/L, respectively. (a) Stiff spacers are well approximated by Equation 17. (b) Flexible spacers are well approximated by Equation 25.

**Figure 4:** Effective concentration for the optimized average end-to-end distance r_ete=d for the wormlike chain...

Jump to Figure 4

Flexible spacer – Gaussian-chain approximation

The effective concentration of flexible polymers is often modeled by a Gaussian chain [11,20,24] with the free energy:

using the mean squared end-to-end distance [Graphic 13] . The end-to-end distance r_ete and the variance Δr can be expressed in terms of the mean squared end-to-end distance:

As a consequence the end-to-end distance r_ete and the variance Δr are related as

Furthermore, the mean squared end-to-end distance can be written as

with b being the Kuhn length of one chain segment and N the number of segments.

We here present the effective concentration as a function of d and r_ete.

Using Equations 19–22, r_ete can as well be substituted by [Graphic 14] , Δr or N.

Note that the effective concentration of a flexible spacer with fixed contour length L is maximal at a distance d = 0, as shown in Figure 2. In contrast, for a given distance d the effective concentration becomes maximal at [Graphic 15] . In other words, the average end-to-end distance of an optimized flexible spacer is smaller than the distance between the binding pockets by a factor of [Graphic 16] :

Since we consider the fluctuations of a flexible chain much larger than the range of the binding pocket, we neglect the influence of σ on the effective concentration. In order to compare the behavior of a Gaussian chain with the results for a flexible wormlike chain, Equation 24 is rewritten as:

In Figure 4b, Equation 25 is shown together with the numerical results from the wormlike chain model obtained in the previous section. The two models show good agreement in the flexible limit, as expected.

Conformational degrees of freedom of a tethered spacer

If one ligand unit is bound to one of the binding pockets, the conformational degrees of freedom of the spacer are reduced, since it cannot penetrate the receptor surface. We quantify this reduction by the parameter α, which describes the ratio between the partition function of a tethered and a free spacer. The value of α depends on the shape of the receptor and the flexibility of the spacer. To estimate the typical magnitude of α we consider as limiting cases a stiff rod as well as a flexible Gaussian chain tethered to a planar surface.

Stiff spacer

For a stiff rod attached with one end to a planar surface, the parameter α becomes α = 1/2, since the rod can only explore one half space.

Flexible spacer

As a second example we discuss a Gaussian chain. Equivalently to Equation 23 the normalized probability that a Gaussian chain consisting of N segments extends to an end-to-end distance r with b being the length of one segment reads in free space:

We now assume that one end of the chain is attached to the surface. Similar to the considerations made for a stiff rod, we approximate the probability that the first segment does not penetrate the surface by a factor 1/2. The probability distribution for the remaining N − 1 segments then reads:

with ρ the component of the end-to-end vector parallel to the surface and z the height above the surface. The last term in Equation 27 ensures that the chain does not penetrate the surface (P′(ρ,z = 0,N) = 0). To obtain the parameter α, P′ has to be integrated over one half space:

In the limit of a long chain (N >> 1), Equation 28 can be approximated as:

A PEG spacer with b = 0.38 nm requires 30–800 segments to adopt an average end-to-end distance of 2 to 10 nm. In this range α varies between 0.02 and 0.13.

Binding affinity

With the effective concentration and a parameterization for the reduction of the conformational degrees of freedom of the spacer at hand, we now can examine the binding affinity of a divalent ligand. A common way to quantify the binding affinity of a multivalent ligand is the so-called IC₅₀ value, the ligand (or inhibitor) concentration at half maximal inhibition. In a first step we want to re-derive the relation between the IC₅₀ value and the dissociation constant of a monovalent ligand [25,26].

Monovalent ligand

In the reaction [Graphic 17] , the dissociation constant K_mono of a monovalent ligand interacting with a monovalent receptor is defined as

with [L] and [R] being the concentration of unbound ligands and unbound receptors and [RL] the concentration of bound ligands or equivalently the concentration of bound receptors.

If half of all receptors are occupied, which defines the IC₅₀ condition, the other half must be unbound and as a consequence [R] = [RL]. From Equation 30 we see that under IC₅₀ conditions the dissociation constant equals the concentration of unbound ligands:

with the index 50 indicating that the IC₅₀ condition is fulfilled. In the monovalent case exactly one ligand binds to one receptor. Thus, the concentration of bound ligands under IC₅₀ conditions is given by half the total receptor concentration:

with [R]₀ = [R] + [RL] the total receptor concentration. Combining Equation 31 and Equation 32 the IC₅₀ value is obtained as [25]:

In the limit of dilute receptor conditions ([R]₀ << K_mono) the IC₅₀ value is a good approximation for the dissociation constant, and we find:

Divalent ligand

In analogy to the monovalent case, we now derive an expression for the IC₅₀ value of a divalent ligand. There are different ways of defining half maximal inhibition for divalent receptors. We first adopt a heuristic definition where half of all receptor binding pockets are occupied by a ligand unit. This definition is most relevant for competitive binding assays, for instance surface plasmon resonance measurements [27], since the measured signal in a competitive binding assay is related to the number of occupied binding pockets. Later, we also define a situation in which at least one ligand unit is bound to half of all receptors as IC₅₀ condition, which mimics non-competitive binding assays, as for instance hemagglutination assays [28]. In non-competitive binding assays the number of bound ligands rather than the number of occupied binding pockets is measured. In general the concentration of occupied binding pockets [bp]_occ of divalent receptors reads:

with [RL_n] being the concentration of bound ligand–receptor pairs, with n referring to the three binding modes summarized in Figure 1b. Each term on the right hand side of Equation 35 has two prefactors. The first prefactor counts the number of occupied binding pockets per receptor and the second prefactor counts the permutations due to the distinguishability of the ligand units and the receptor binding pockets (see Figure 1b). Note that the number of permutations presented in Figure 1b and Equation 35, are obtained for distinguishable ligand units. For indistinguishable ligand units the number of permutations in each binding mode is reduced. At the same time the dissociation constant of a ligand with indistinguishable ligand units is reduced by the same factor. Hence, the overall concentration of bound ligands does not change. A detailed derivation of the dissociation constants for each binding mode is presented in Supporting Information File 1.

In the same way the total concentration of binding pockets, [bp]₀, can be obtained as

In order to discuss also the IC₅₀ condition for non-competitive binding assays we derive the concentration of receptors with at least one binding pocket occupied, [R]_1bp, and the total receptor concentration, [R]_0, as

With Equations 35–38 the IC₅₀ condition for competitive and non-competitive binding is expressed as given in Equation 39 and Equation 40.

In analogy to the monovalent case we define the multivalent dissociation constant K_multi as the concentration of free ligand under IC₅₀ conditions, as defined in Equation 39 and Equation 40.

Equation 41 and Equation 42 show the multivalent dissociation constant K_multi in case of competitive binding and non-competitive binding, respectively.

Competitive and non-competitive binding exhibit the same qualitative behavior for large effective concentrations. We therefore limit the further discussion to competitive binding, as given in Equation 41.

As one would intuitively expect, the multivalent dissociation constant K_multi becomes proportional to the monovalent dissociation constant, if the effective concentration is low, i.e., if [Graphic 18] . In contrast, the multivalent dissociation constant decreases, if the dissociation constant of the monovalent ligand is small and if the effective concentration, i.e., the probability to connect two binding pockets, is large.

To determine the total ligand concentration we first have to derive the concentration of bound ligand [L]_bound as shown in Equation 43.

Using Equation 38 and 43, a relation between the concentration of bound ligands and the total receptor concentration under IC₅₀ conditions is obtained as

where we note that that ψ is a coefficient that varies between 1 and 5/4. Similar to the results for monovalent receptor–ligand systems in Equation 34, the IC₅₀ value becomes equivalent to the multivalent dissociation constant, in the limit of low receptor concentrations, i.e., for [R]₀ << K_multi:

To compare monovalent and multivalent ligands we use the relative binding affinity (RBA), which we define as

Here, the factor 2 accounts for the valency of the ligand and ensures that the concentration of ligand units are compared. The larger the RBA the better is the divalent ligand. For RBA = 1 the same concentration of mono- and divalent ligand units, taking into account that a divalent ligand consist of two ligand units, is required to occupy half of the receptor binding pockets. For RBA < 1 the monovalent ligand binds better than the divalent ligand. In this case the loss in entropy of the spacer is larger than the gain in binding energy due to the multiple binding of ligand units. Inserting the effective concentration from Equation 13 and Equation 23 into Equation 41 and Equation 47, the RBA can be calculated for any given divalent ligand–receptor pair. As an example the RBA is depicted for different spacers and different values of K_mono in Figure 5. We here assume that the receptor is well described by a large, planar surface. Hence, the parameter α is approximated by 1/2 for stiff spacer and by Equation 29 for flexible spacers. In all cases we consider a divalent receptor with a distance d = 5 nm between the binding pockets. Each binding pocket has a binding range σ = 0.1 nm. In all three subfigures we see that if K_mono is too large, i.e., if the monovalent binder is too weak, the RBA-value never reaches 1. In such a situation, using the RBA-value as a quantifier, the monovalent ligand binds always better than the divalent ligand. Furthermore, at a certain K_mono, which we will further on denote as [Graphic 19] , there is exactly one spacer length, parameterized by r_ete, for which monovalent and divalent ligands bind equally well. If K_mono is lower than [Graphic 20] , there is a broader range of spacer lengths for which the divalent ligand binds better than the monovalent ligand (RBA > 1). In Figure 5a the behavior of a stiff spacer with persistence length l_p = 53 nm is depicted, which mimics a DNA spacer to which the ligand units are directly attached. A DNA spacer with a contour length of 5 nm exhibits fluctuations in the range Δr ≈ 0.05 nm, which is considerably smaller than the binding range σ. As is discussed in the previous section, the maximum and width of the effective concentration and therefore also the maximum and width of the RBA are in this case determined by the binding range σ. In Figure 5b we assume a DNA spacer that is decorated with flexible PEG linkers at both ends. The PEG linkers consist of four monomers each. Assuming Gaussian-chain behavior with a segment length of b = 0.38 nm [29], the fluctuations of the PEG linkers and hence the fluctuations of the whole ligand sum up to Δr = 0.5 nm. The shape of the RBA now is much broader, showing that the ligand is less affected by a mismatch between spacer length and distance between the binding pockets. Additionally, we obtain [Graphic 21] = 5 mM in Figure 5b which is considerably smaller than [Graphic 22] = 28 mM for the pure DNA spacer in Figure 5a. The same trend is continued in Figure 5c. The more flexible the spacer, the smaller is [Graphic 23] , indicating that flexible spacers are less suitable to improve the binding affinity of weak monovalent binders, even though they are more tolerant with respect to a mismatch between linker length and receptor distance.

[1860-5397-11-90-5] — **Figure 5:** Relative binding affinity (*RBA*) of a divalent ligand in dependence of the end-to-end distance of the spacer r_ete from Equation 47. The three different ligand–spacer constructs are schematically depicted in the insets. The binding pockets are separated by d = 5 nm. Each binding pocket has a binding range of σ = 0.1 nm. (a) The ligand units are directly attached to a stiff DNA spacer, characterized by a persistence length l_p = 53 nm. (b) The ligand units are attached to a stiff DNA spacer with flexible linker chain, leading to an end-to-end distance fluctuation of Δr = 0.5 nm. (c) The ligand units are connected via a flexible spacer.

**Figure 5:** Relative binding affinity (*RBA*) of a divalent ligand in dependence of the end-to-end distance of th...

Jump to Figure 5

To investigate the transition from RBA < 1 to RBA > 1 further, we determine the critical dissociation constant [Graphic 24] for which the RBA is equal to one for the optimized chain length, i.e., for the chain length that maximizes the RBA value. Using Equation 41 and Equation 47 it can easily be seen that [Graphic 25] relates to the effective concentration [Graphic 26] as

In Figure 6, [Graphic 27] is shown for stiff as well as flexible ligands. The stiff ligand is considered to consist of a DNA spacer to which the ligand units are attached via two PEG linkers. Linker length and binding range are set to be identical to the example presented in Figure 5b. The average end-to-end distance of the DNA spacer is either chosen to be equal to d (black, continuous line), or is chosen to be too short by 0.7 nm, which mimics the length of two base pairs (red, continuous line). Even though the mismatch between spacer length and binding pocket distance is small, the ligand becomes significantly less efficient.

The flexible ligand is chosen to resemble a PEG spacer. Again, we assume Gaussian-chain behavior with a segment length of b = 0.38 nm. A ligand with optimized spacer length (black, dashed line) does not exhibit a significant difference to a ligand with a spacer that is shortened by two segments (red, dashed line). This shows again that a flexible chain is more tolerant with respect to a distance mismatch between inter-binding pocket distance d and chain length.

If the monovalent dissociation constant is larger than [Graphic 28] , a monovalent ligand always binds better than a divalent ligand. On the other hand, if the monovalent dissociation constant is smaller than [Graphic 29] , a divalent ligand of optimally (or slightly suboptimal) chosen size binds better than a monovalent ligand.

As can be seen in Figure 6, [Graphic 30] depends on the distance between the binding pockets as well as the spacer length and flexibility. In order to approximate an upper limit for [Graphic 31] , the maximum effective concentration (Equation 24 for a flexible spacer and Equation 15 and Equation 16 for a stiff spacer) is substituted into Equation 48:

As an example that is relevant for medical applications we want to briefly discuss the interaction between hemagglutinin (HA), a receptor protein on the surface of influenza viruses, and its ligand sialic acid (SA). The dissociation constant between monomeric SA and trimeric HA is known to be 2.5 mM [1]. Furthermore, the crystal structure of HA [30] indicates a distance between neighboring binding pockets in the range of d = 5 nm. Note that HA is a trivalent receptor, which means that additional binding modes as well as different numbers of permutations (see Figure 1b) have to be considered. Nevertheless, since the efficiency of a divalent ligand is mainly influenced by the effective concentration [Graphic 32] and the monovalent dissociation constant K_mono, rather than by the number of binding modes, we can compare the values for the SA–HA pair with the results presented in Figure 6. We see that a divalent ligand consisting of two SA units connected via a PEG spacer is expected to bind less efficient than the monovalent SA. In contrast, a stiff DNA spacer can increase the binding affinity of the divalent ligand compared to the monovalent ligand, if its length is optimized.

[1860-5397-11-90-6] — **Figure 6:** Efficiency diagram: is shown for different ligand–spacer constructs. If the monovalent dissociation constant is larger than , a monovalent ligand always binds better than a divalent ligand. If, on the other hand, the monovalent dissociation constant is smaller than , a divalent ligand of suitably chosen length binds better than its monovalent counterpart. We present in dependence of the distance between the binding pockets for a DNA spacer with flexible PEG linkers (Δr = 0.5 nm). In the optimal case, the spacer length is chosen equal to the distance d (black, continuous line). In the slightly suboptimal case, the spacer length is chosen to be 0.7 nm (two base pairs) shorter than the distance d (red, continuous line). In both cases the binding range is set to σ = 0.1 nm. We also show for a flexible PEG spacer with optimized spacer length (black, dashed line) and a spacer that is two monomers shorter (≈0.76 nm) (red, dashed line). The monovalent dissociation constant as well as the distance between neighboring binding pockets for a SA–HA pair is indicated by a black point.

**Figure 6:** Efficiency diagram: is shown for different ligand–spacer constructs. If the monovalent dissociatio...

Jump to Figure 6

[Graphic 33] — **Figure 6:** Efficiency diagram: is shown for different ligand–spacer constructs. If the monovalent dissociation constant is larger than , a monovalent ligand always binds better than a divalent ligand. If, on the other hand, the monovalent dissociation constant is smaller than , a divalent ligand of suitably chosen length binds better than its monovalent counterpart. We present in dependence of the distance between the binding pockets for a DNA spacer with flexible PEG linkers (Δr = 0.5 nm). In the optimal case, the spacer length is chosen equal to the distance d (black, continuous line). In the slightly suboptimal case, the spacer length is chosen to be 0.7 nm (two base pairs) shorter than the distance d (red, continuous line). In both cases the binding range is set to σ = 0.1 nm. We also show for a flexible PEG spacer with optimized spacer length (black, dashed line) and a spacer that is two monomers shorter (≈0.76 nm) (red, dashed line). The monovalent dissociation constant as well as the distance between neighboring binding pockets for a SA–HA pair is indicated by a black point.

**Figure 6:** Efficiency diagram: is shown for different ligand–spacer constructs. If the monovalent dissociatio...

Jump to Figure 6

Conclusion

In the present work we first examine different polymeric models for the effective concentration. We find that a wormlike-chain model can be well reproduced by a simple harmonic spring model and a Gaussian-chain model with suitable chosen parameters, in the stiff and flexible limits, respectively. We next study the binding between divalent ligand–receptor pairs. We find that multivalency increases the overall binding affinity only, if the monovalent ligand–receptor pair binds strongly enough, i.e.; if the monovalent dissociation constant is smaller than a critical value [Graphic 39] . Approximations for [Graphic 40] for both flexible and stiff ligands are derived in dependence of the distance between the binding pockets and the spacer length and flexibility. For the optimal ligand design, we find that for stiff ligands the average end-to-end distance should be equal to the distance between the binding pockets and the average fluctuations should be of the order, but not smaller, than the binding range. The average end-to-end distance of a flexible ligand on the other side should be smaller by a factor of [Graphic 41] than the binding pocket distance d.

Supporting Information

Supporting Information File 1: Detailed derivation of the dissociation constants for three different binding modes of a divalent ligand.
Format: PDF	Size: 257.6 KB	Download

Acknowledgements

This contribution was generously supported by the Deutsche Forschungsgemeinschaft DFG via grant SFB 765.

References

Mammen, M.; Choi, S.-K.; Whitesides, G. M. Angew. Chem., Int. Ed. 1998, 37, 2754–2794. doi:10.1002/(SICI)1521-3773(19981102)37:20<2754::AID-ANIE2754>3.0.CO;2-3
Return to citation in text: [1] [2] [3]
Kiessling, L. L.; Young, T.; Mortell, K. H. Multivalency in Protein–Carbohydrate Recognition. In Glycoscience – Chemistry and Chemical Biology; Fraser-Reid, B. O.; Tatsuka, K.; Thiem, J., Eds.; Springer: Berlin, Germany, 2001; pp 1817–1861. doi:10.1007/978-3-642-56874-9_42
Return to citation in text: [1] [2]
Pieters, R. J. Org. Biomol. Chem. 2009, 7, 2013–2025. doi:10.1039/b901828j
Return to citation in text: [1]
Disney, M. D.; Zheng, J.; Swager, T. M.; Seeberger, P. H. J. Am. Chem. Soc. 2004, 126, 13343–13346. doi:10.1021/ja047936i
Return to citation in text: [1]
Wang, J.; Tian, S.; Petros, R. A.; Napier, M. E.; DeSimone, J. M. J. Am. Chem. Soc. 2010, 132, 11306–11313. doi:10.1021/ja1043177
Return to citation in text: [1]
Schaschke, N.; Matschiner, G.; Zettl, F.; Marquardt, U.; Bergner, A.; Bode, W.; Sommerhoff, C. P.; Moroder, L. Chem. Biol. 2001, 8, 313–327. doi:10.1016/S1074-5521(01)00011-4
Return to citation in text: [1]
Vance, D.; Shah, M.; Joshi, A.; Kane, R. S. Biotechnol. Bioeng. 2008, 101, 429–434. doi:10.1002/bit.22056
Return to citation in text: [1]
Martinez-Veracoechea, F. J.; Frenkel, D. Proc. Natl. Acad. Sci. U. S. A. 2011, 108, 10963–10968. doi:10.1073/pnas.1105351108
Return to citation in text: [1]
Hu, J.; Lipowsky, R.; Weikl, T. R. Proc. Natl. Acad. Sci. U. S. A. 2013, 110, 15283–15288. doi:10.1073/pnas.1305766110
Return to citation in text: [1]
Wang, S.; Dormidontova, E. E. Phys. Rev. Lett. 2012, 109, 238102. doi:10.1103/PhysRevLett.109.238102
Return to citation in text: [1]
Diestler, D. J.; Knapp, E. W. J. Phys. Chem. A 2010, 114, 5287–5304. doi:10.1021/jp100077n
Return to citation in text: [1] [2]
Weber, M.; Bujotzek, A.; Haag, R. J. Chem. Phys. 2012, 137, 054111. doi:10.1063/1.4739501
Return to citation in text: [1]
Huskens, J.; Mulder, A.; Auletta, T.; Nijhius, C. A.; Ludden, M. J. W.; Reinhoudt, D. N. J. Am. Chem. Soc. 2004, 126, 6784–6797. doi:10.1021/ja049085k
Return to citation in text: [1]
Varilly, P.; Angioletti-Uberti, S.; Mognetti, B. M.; Frenkel, D. J. Chem. Phys. 2012, 137, 094108. doi:10.1063/1.4748100
Return to citation in text: [1]
Angioletti-Uberti, S.; Varilly, P.; Mognetti, B. M.; Tkachenko, A. V.; Frenkel, D. J. Chem. Phys. 2013, 138, 021102. doi:10.1063/1.4775806
Return to citation in text: [1]
Pertici, F.; de Mol, N. J.; Kemmink, J. M.; Pieters, R. J. Chem. – Eur. J. 2013, 19, 16923–16927. doi:10.1002/chem.201303463
Return to citation in text: [1]
Mack, E. T.; Snyder, P. W.; Perez-Castillejos, R.; Bilgiçer, B.; Moustakes, D. T.; Butte, M. J.; Whitesides, G. M. J. Am. Chem. Soc. 2012, 134, 333–345. doi:10.1021/ja2073033
Return to citation in text: [1]
Shan, M.; Bujotzek, A.; Abendroth, F.; Wellner, A.; Gust, R.; Seitz, O.; Weber, M.; Haag, R. ChemBioChem 2011, 12, 2587–2598. doi:10.1002/cbic.201100529
Return to citation in text: [1]
Scheibe, C.; Bujotzek, A.; Dernedde, J.; Weber, M.; Seitz, O. Chem. Sci. 2011, 2, 770–775. doi:10.1039/c0sc00565g
Return to citation in text: [1]
Krishnamurthy, V. M.; Estroff, L. A.; Whitesides, G. M. Multivalency in Ligand Design. In Fragment-based Approaches in Drug Design; Jahnke, W.; Erlanson, D. A., Eds.; Wiley-VCH: Weinheim, Germany, 2006; pp 11–53. doi:10.1002/3527608761.ch2
Return to citation in text: [1] [2]
Samuel, J.; Sinha, S. Phys. Rev. E 2002, 66, 050801. doi:10.1103/PhysRevE.66.050801
Return to citation in text: [1]
MacKintosh, F. C.; Käs, J.; Janmey, P. A. Phys. Rev. Lett. 1995, 75, 4425–4428. doi:10.1103/PhysRevLett.75.4425
Return to citation in text: [1]
Kienberger, F.; Pastushenko, V. P.; Kada, G.; Gruber, H. J.; Riener, C.; Schindler, H.; Hinterdorfer, P. Single Mol. 2000, 1, 123–128. doi:10.1002/1438-5171(200006)1:2<123::AID-SIMO123>3.0.CO;2-3
Return to citation in text: [1]
Gargano, J. M.; Ngo, T.; Kim, J. Y.; Acheson, D. W. K.; Lees, W. J. J. Am. Chem. Soc. 2001, 123, 12909–12910. doi:10.1021/ja016305a
Return to citation in text: [1]
Shoichet, B. K. J. Med. Chem. 2006, 49, 7274. doi:10.1021/jm061103g
Return to citation in text: [1] [2]
Hulme, E. C. Receptor–Ligand Interactions A Practical Approach; Oxford University Press: Oxford, United Kingdom, 1992.
Return to citation in text: [1]
Zeng, S.; Baillargeat, D.; Ho, H.-P.; Yong, K.-T. Chem. Soc. Rev. 2014, 43, 3426–3452. doi:10.1039/c3cs60479a
Return to citation in text: [1]
Hirst, G. K. J. Exp. Med. 1942, 75, 49–64. doi:10.1084/jem.75.1.49
Return to citation in text: [1]
Oesterhelt, F.; Rief, M.; Gaub, H. E. New J. Phys. 1999, 1, 6. doi:10.1088/1367-2630/1/1/006
Return to citation in text: [1]
Sauter, N. K.; Hanson, J. E.; Glick, G. D.; Brown, J. H.; Crowther, R. L.; Park, S. J.; Skehel, J. J.; Wiley, D. C. Biochemistry 1992, 31, 9609–9621. doi:10.1021/bi00155a013
Return to citation in text: [1]

© 2015 Liese and Netz; licensee Beilstein-Institut.
This is an Open Access article under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
The license is subject to the Beilstein Journal of Organic Chemistry terms and conditions: (http://www.beilstein-journals.org/bjoc)

All Thematic Issues All volumes

Article is part of the thematic issue

Multivalency as a chemical organization and action principle

Rainer Haag

Interesting articles

Photoswitchable glycoligands targeting Pseudomonas aeruginosa LecA

Yu Fan, Ahmed El Rhaz, Stéphane Maisonneuve, Emilie Gillon, Maha Fatthalla, Franck Le Bideau, Guillaume Laurent, Samir Messaoudi, Anne Imberty and Juan Xie

Switchable molecular tweezers: design and applications

Pablo Msellem, Maksym Dekthiarenko, Nihal Hadj Seyd and Guillaume Vives

Defining the hydrophobic interactions that drive competence stimulating peptide (CSP)-ComD binding in Streptococcus pneumoniae

Bimal Koirala, Robert A. Hillman, Erin K. Tiwold, Michael A. Bertucci and Yftah Tal-Gan

Other Beilstein-Institut Open Science Activities

[R1] Mammen, M.; Choi, S.-K.; Whitesides, G. M. Angew. Chem., Int. Ed. 1998, 37, 2754–2794. doi:10.1002/(SICI)1521-3773(19981102)37:20<2754::AID-ANIE2754>3.0.CO;2-3
Return to citation in text: [1] [2] [3]

[R2] Kiessling, L. L.; Young, T.; Mortell, K. H. Multivalency in Protein–Carbohydrate Recognition. In Glycoscience – Chemistry and Chemical Biology; Fraser-Reid, B. O.; Tatsuka, K.; Thiem, J., Eds.; Springer: Berlin, Germany, 2001; pp 1817–1861. doi:10.1007/978-3-642-56874-9_42
Return to citation in text: [1] [2]

[R3] Pieters, R. J. Org. Biomol. Chem. 2009, 7, 2013–2025. doi:10.1039/b901828j
Return to citation in text: [1]

[R4] Disney, M. D.; Zheng, J.; Swager, T. M.; Seeberger, P. H. J. Am. Chem. Soc. 2004, 126, 13343–13346. doi:10.1021/ja047936i
Return to citation in text: [1]

[R5] Wang, J.; Tian, S.; Petros, R. A.; Napier, M. E.; DeSimone, J. M. J. Am. Chem. Soc. 2010, 132, 11306–11313. doi:10.1021/ja1043177
Return to citation in text: [1]

[R6] Schaschke, N.; Matschiner, G.; Zettl, F.; Marquardt, U.; Bergner, A.; Bode, W.; Sommerhoff, C. P.; Moroder, L. Chem. Biol. 2001, 8, 313–327. doi:10.1016/S1074-5521(01)00011-4
Return to citation in text: [1]

[R7] Vance, D.; Shah, M.; Joshi, A.; Kane, R. S. Biotechnol. Bioeng. 2008, 101, 429–434. doi:10.1002/bit.22056
Return to citation in text: [1]

[R8] Martinez-Veracoechea, F. J.; Frenkel, D. Proc. Natl. Acad. Sci. U. S. A. 2011, 108, 10963–10968. doi:10.1073/pnas.1105351108
Return to citation in text: [1]

[R9] Hu, J.; Lipowsky, R.; Weikl, T. R. Proc. Natl. Acad. Sci. U. S. A. 2013, 110, 15283–15288. doi:10.1073/pnas.1305766110
Return to citation in text: [1]

[R10] Wang, S.; Dormidontova, E. E. Phys. Rev. Lett. 2012, 109, 238102. doi:10.1103/PhysRevLett.109.238102
Return to citation in text: [1]

[R11] Diestler, D. J.; Knapp, E. W. J. Phys. Chem. A 2010, 114, 5287–5304. doi:10.1021/jp100077n
Return to citation in text: [1] [2]

[R12] Weber, M.; Bujotzek, A.; Haag, R. J. Chem. Phys. 2012, 137, 054111. doi:10.1063/1.4739501
Return to citation in text: [1]

[R13] Huskens, J.; Mulder, A.; Auletta, T.; Nijhius, C. A.; Ludden, M. J. W.; Reinhoudt, D. N. J. Am. Chem. Soc. 2004, 126, 6784–6797. doi:10.1021/ja049085k
Return to citation in text: [1]

[R14] Varilly, P.; Angioletti-Uberti, S.; Mognetti, B. M.; Frenkel, D. J. Chem. Phys. 2012, 137, 094108. doi:10.1063/1.4748100
Return to citation in text: [1]

[R15] Angioletti-Uberti, S.; Varilly, P.; Mognetti, B. M.; Tkachenko, A. V.; Frenkel, D. J. Chem. Phys. 2013, 138, 021102. doi:10.1063/1.4775806
Return to citation in text: [1]

[R16] Pertici, F.; de Mol, N. J.; Kemmink, J. M.; Pieters, R. J. Chem. – Eur. J. 2013, 19, 16923–16927. doi:10.1002/chem.201303463
Return to citation in text: [1]

[R17] Mack, E. T.; Snyder, P. W.; Perez-Castillejos, R.; Bilgiçer, B.; Moustakes, D. T.; Butte, M. J.; Whitesides, G. M. J. Am. Chem. Soc. 2012, 134, 333–345. doi:10.1021/ja2073033
Return to citation in text: [1]

[R18] Shan, M.; Bujotzek, A.; Abendroth, F.; Wellner, A.; Gust, R.; Seitz, O.; Weber, M.; Haag, R. ChemBioChem 2011, 12, 2587–2598. doi:10.1002/cbic.201100529
Return to citation in text: [1]

[R19] Scheibe, C.; Bujotzek, A.; Dernedde, J.; Weber, M.; Seitz, O. Chem. Sci. 2011, 2, 770–775. doi:10.1039/c0sc00565g
Return to citation in text: [1]

[R20] Krishnamurthy, V. M.; Estroff, L. A.; Whitesides, G. M. Multivalency in Ligand Design. In Fragment-based Approaches in Drug Design; Jahnke, W.; Erlanson, D. A., Eds.; Wiley-VCH: Weinheim, Germany, 2006; pp 11–53. doi:10.1002/3527608761.ch2
Return to citation in text: [1] [2]

[R21] Samuel, J.; Sinha, S. Phys. Rev. E 2002, 66, 050801. doi:10.1103/PhysRevE.66.050801
Return to citation in text: [1]

[R22] MacKintosh, F. C.; Käs, J.; Janmey, P. A. Phys. Rev. Lett. 1995, 75, 4425–4428. doi:10.1103/PhysRevLett.75.4425
Return to citation in text: [1]

[R23] Kienberger, F.; Pastushenko, V. P.; Kada, G.; Gruber, H. J.; Riener, C.; Schindler, H.; Hinterdorfer, P. Single Mol. 2000, 1, 123–128. doi:10.1002/1438-5171(200006)1:2<123::AID-SIMO123>3.0.CO;2-3
Return to citation in text: [1]

[R24] Gargano, J. M.; Ngo, T.; Kim, J. Y.; Acheson, D. W. K.; Lees, W. J. J. Am. Chem. Soc. 2001, 123, 12909–12910. doi:10.1021/ja016305a
Return to citation in text: [1]

[R25] Shoichet, B. K. J. Med. Chem. 2006, 49, 7274. doi:10.1021/jm061103g
Return to citation in text: [1] [2]

[R26] Hulme, E. C. Receptor–Ligand Interactions A Practical Approach; Oxford University Press: Oxford, United Kingdom, 1992.
Return to citation in text: [1]

[R27] Zeng, S.; Baillargeat, D.; Ho, H.-P.; Yong, K.-T. Chem. Soc. Rev. 2014, 43, 3426–3452. doi:10.1039/c3cs60479a
Return to citation in text: [1]

[R28] Hirst, G. K. J. Exp. Med. 1942, 75, 49–64. doi:10.1084/jem.75.1.49
Return to citation in text: [1]

[R29] Oesterhelt, F.; Rief, M.; Gaub, H. E. New J. Phys. 1999, 1, 6. doi:10.1088/1367-2630/1/1/006
Return to citation in text: [1]

[R30] Sauter, N. K.; Hanson, J. E.; Glick, G. D.; Brown, J. H.; Crowther, R. L.; Park, S. J.; Skehel, J. J.; Wiley, D. C. Biochemistry 1992, 31, 9609–9621. doi:10.1021/bi00155a013
Return to citation in text: [1]

aromatic	the word “aromatic”
aromatic aldehyde	the word “aromatic” OR “aldehyde”
+aromatic +aldehyde	both words “aromatic” AND “aldehyde”
+aromatic -aldehyde	the word “aromatic” but NOT “aldehyde”
“aromatic aldehyde”	the exact phrase “aromatic aldehyde”
benz*	words which begin with “benz”, such as “benzene” or “benzyl”
benz*yl	words that begin with “benz” and end with “yl”, such as “benzyl” or “benzoyl”
benzyl~	words that are close to the word “benzyl”, such as “benzoyl” (i.e., fuzzy search)

1.	Mammen, M.; Choi, S.-K.; Whitesides, G. M. Angew. Chem., Int. Ed. 1998, 37, 2754–2794. doi:10.1002/(SICI)1521-3773(19981102)37:20<2754::AID-ANIE2754>3.0.CO;2-3
2.	Kiessling, L. L.; Young, T.; Mortell, K. H. Multivalency in Protein–Carbohydrate Recognition. In Glycoscience – Chemistry and Chemical Biology; Fraser-Reid, B. O.; Tatsuka, K.; Thiem, J., Eds.; Springer: Berlin, Germany, 2001; pp 1817–1861. doi:10.1007/978-3-642-56874-9_42
3.	Pieters, R. J. Org. Biomol. Chem. 2009, 7, 2013–2025. doi:10.1039/b901828j

8.	Martinez-Veracoechea, F. J.; Frenkel, D. Proc. Natl. Acad. Sci. U. S. A. 2011, 108, 10963–10968. doi:10.1073/pnas.1105351108
9.	Hu, J.; Lipowsky, R.; Weikl, T. R. Proc. Natl. Acad. Sci. U. S. A. 2013, 110, 15283–15288. doi:10.1073/pnas.1305766110
10.	Wang, S.; Dormidontova, E. E. Phys. Rev. Lett. 2012, 109, 238102. doi:10.1103/PhysRevLett.109.238102
11.	Diestler, D. J.; Knapp, E. W. J. Phys. Chem. A 2010, 114, 5287–5304. doi:10.1021/jp100077n
12.	Weber, M.; Bujotzek, A.; Haag, R. J. Chem. Phys. 2012, 137, 054111. doi:10.1063/1.4739501
13.	Huskens, J.; Mulder, A.; Auletta, T.; Nijhius, C. A.; Ludden, M. J. W.; Reinhoudt, D. N. J. Am. Chem. Soc. 2004, 126, 6784–6797. doi:10.1021/ja049085k

4.	Disney, M. D.; Zheng, J.; Swager, T. M.; Seeberger, P. H. J. Am. Chem. Soc. 2004, 126, 13343–13346. doi:10.1021/ja047936i
5.	Wang, J.; Tian, S.; Petros, R. A.; Napier, M. E.; DeSimone, J. M. J. Am. Chem. Soc. 2010, 132, 11306–11313. doi:10.1021/ja1043177
6.	Schaschke, N.; Matschiner, G.; Zettl, F.; Marquardt, U.; Bergner, A.; Bode, W.; Sommerhoff, C. P.; Moroder, L. Chem. Biol. 2001, 8, 313–327. doi:10.1016/S1074-5521(01)00011-4
7.	Vance, D.; Shah, M.; Joshi, A.; Kane, R. S. Biotechnol. Bioeng. 2008, 101, 429–434. doi:10.1002/bit.22056