Abstract:
A method to determine the type of knot formed by a compact
lattice chain is described. The compact lattice chain refers to a
lattice self-avoiding walk that visits ever lattice site within a cubic
region. The method is implemented in computer code as C++ and Java
programs. As a first measure of knot complexity, the average number of
crossings in the knot projection for a range of chain sizes is determined.
Next, the frequency of occurrence of the first few types of knots in the
compact lattice chains is measured. There is also particular interest
in isolating those chains that are trivial or unknotted. These chains
may be analyzed further to yield information such as the statistics of
the sub-chains.
Contents
I. Introduction
II. Method
A. Generating
Compact Lattice Chains
B. Generating
Plane Projection
C. Preprocessing
D. Computing
the Alexander Polynomial
E. Computing
the Vassiliev Invariants
III. Results
A. Average
Crossing Number
B. Knot/Unknot
Probabilities
C. Sub-chain
Statistics
IV. Conclusions
Acknowledgments
Appendix
A. Programs
and Source Code
References
Self-avoiding random walks, both on
and off lattice, are frequently used to model the spatial configuration
of a polymer chain. In a lattice random walk, the monomer units comprising
the polymer are represented by the lattice sites visited by the walk, while
the bonds are represented by the links between adjacent lattice sites.
A compact lattice chain is a self-avoiding
walk that visits every lattice site within a cubic region and forms a closed
loop in space (Figure 1a). This chain could
serve as a model of a compact loop of a single polymer (e.g. a globular
protein), or a 'sea' of polymer strands (e.g. a polymer melt).
Compact lattice chains are very amenable
to computer analysis. As such, it may be used effectively to explore
interesting properties of polymers, foremost being their knottedness or
topology. For example, it is found that DNA loops in gel-electrophoresis
experiments move faster the more complicated the knot type [1]. The
knottedness of compact lattice chains may also manifest itself on the statistics
of its sub-chains or segments. (These sub-chains could possibly model
polymers in a 'melt'.) For instance, the relationship between mean square
end-to-end distance (or some measure of size, such as gyration radius)
of a sub-chain and the number of steps comprising this sub-chain could
be studied to yield 'scaling relations' and 'critical exponents'.
A knot is a closed curve in space
which does not intersect itself. An intuitive grasp of what a knot
is should suffice to enable one to understand, and hopefully appreciate,
the ideas described in this paper. Ironically, it is this intuitive
appeal of knots that makes its study and associated computations not-so-trivial.
The necessary concepts will be introduced as we go along.
The primary mathematical tools employed
from knot theory to determine the knot type are the Alexander polynomial
invariant, a.k.a. D(t) [2], and the Vassiliev
invariant of degrees two (v2) and three (v3) [3]. The knot invariants
serve as a signature of the knot type. These invariants are computed
from a plane projection of the knot. Each invariant is guaranteed
not to change value or form as the knot is deformed without cutting or
letting the strands pass through each other (these 'destructive' operations
may change the knot type). A single knot invariant may correspond
to more than one knot type though (more about this later).
The compact lattice chains studied
in this project are generated with code developed by Alexander Borovinskiy
based on (and improving upon) reference [4]. The chains are specified
on a regular lattice as a sequence of non-negative integers (x y z).
As a rough measure of the complexity of the knots formed for each chain
size, the average number of crossings in the projection is determined.
The knot invariants for each chain are also determined. These lead to each
chain being associated with a particular knot type. The spatial coordinates
of chains with knot invariants corresponding to a trivial knot are collected
and analyzed further.
Starting from a compact lattice chain (Figure 1a), which we henceforth just call a chain, it is desired to obtain a set of numbers corresponding to the Alexander and Vassiliev invariants. I shall illustrate the process visually, but keep in mind that much of the steps are implemented on abstract program data structures. The following are the major steps (See also figure 1 a-d):
Each step is discussed in more detail below.
A. Generating
Compact Lattice Chains
A chain is generated using a method
called Hamiltonian path generation [4]. A chain is specified by giving
the non-negative integer coordinates of its nodes. The dimension
(LxLxL) of a closed chain may only be even. Thus, the chain lengths
explored in this study belong to the sequence N=L3=43=64,
216, 512, 1000, ... (no knots are formed for L=2). The generated chains are assumed to be 'fairly' sampled (ergodic).
B. Generating Plane
Projection
The knot invariants are usually evaluated
given a planar representation of the knot. This is obtained by projecting
the nodes (sites) and links of a chain onto a plane. The criteria
for a good projection are (1) the overpassing and underpassing strands
of the knot are clearly specified, and (2) only two strands may intersect
at a point on the projection. This may be achieved simply by looking
at figure 2a. The nodes are projected onto the diagonal of a horizontal
square. We may consider the x,y coordinates on the plane projection
of a node to be related to its x,y,z coordinates in space in the following
manner: (x,y,z) --> (x+z/L, y+z/L). Here L is the maximum coordinate
+ 1, or one more than the dimension of the bounding cubic region.
Note that this method is better than using rotation matrices which would
require explicit computation of sines and cosines.
Figure 2a. Projecting the nodes (circles) onto the horizontal
plane. Idea suggested by Alexander Borovinskiy.
Figure 2b. Projected nodes and links of figure 1a by the
method of figure 2a.
By referring to figure 2b, it is clear
that crossings (intersections) only occur between links that appear as
horizontal and vertical segments in the projection. As to which segment
in a crossing is an overpass, one need only check which segment corresponds
to nodes that have a larger z-coordinate.
Another important piece of information
associated with a crossing is its 'sign'. The sign represents the
relative orientation of the crossing segments. It is required in
specifying a knot and calculating the invariants. The sign is defined
by imagining taking a trip along the path defined by the knot, in the process
associating a direction or arrow to the knot. A +1 (-1) is
then assigned to the crossing in which the overpassing segment points to
the right (left) of the underpassing segment (Figure 3).
Figure 3. Sign of an intersection.
These three pieces of information - sequence of crossings, overpass/underpass, sign - suffice to specify a knot.
C.Preprocessing
Once a plane projection faithfully
representing the knottedness of a chain is obtained, one more practical
step is needed before the knot invariants are calculated. It is apparent
that there occur many crossings that are not essential in classifying a
knot. These crossings may be removed by intuitive operations on the
knot projection called Reidemeister moves (Figure 4a). Reidemeister
has shown that two knots are equivalent (the knot theory term is 'ambient
isotopic') if and only if one may be transformed into the other via a sequence
of Reidemeister moves. Unfortunately, there is no recipe for producing
such a sequence of moves in a quick and deterministic fashion.
In the absence of an algorithm to
minimize the number of crossings, the simplest thing to do is to scan the
knot for certain patterns and implement moves on these patterns that would
reduce (but not necessarily minimize) the number of crossings. These
moves are the first two Reidemeister moves plus that shown in Figure 4b.
Figure 4a. The three Reidemeister moves.
Move I is simple uncurling. Move II is removal of overlap.
Move III slides the vertical segment passed a crossing.
It is to be understood that one is working with only portions of a
knot
Figure 4b. Top drawing shows a combination of the third
and first Reidemeister moves.
The bottom drawing shows a loop underpassing an unspecified section
of the knot.
The reduction is necessary from a code
performance standpoint as the computation time of the knot invariants rises
quickly as the number of crossings in the knot projection increases (see
D and E). In the range of chain sizes explored, the moves reduce
the number of crossings to less than half (see figure 6). It is found
that the most effective in removing crossings is move II while the least
effective is move I+III (Figure 4b).
An upper bound for the number of crossings
for a given L may be calculated by referring to a diagram like figure 2b.
Letting each node be connected to its nearest neighbors, the exact number
of crossings comes out as L(L-1)3. In fact one may go
further and predict the average number of crossings. Assuming the
orientations of the links (of the 3-D chain) are equally distributed among
the three orthogonal directions, one may derive the following expression
(Expression 1)D. Computing
the Alexander Polynomial
What follows is a short discussion
of the general features of the knot invariants used in this study.
The detailed recipes for computing them are described very well in the
references.
The Alexander polynomial is a one-variable
polynomial (D(t)) with integer coefficients.
The polynomial corresponding to a knot does not change (within powers of
t) as the knot is deformed non-destructively in space, which is equivalent
to performing Reidemeister moves on the projection. Calculation of
the Alexander polynomial entails constructing a matrix, called the Alexander
matrix, and taking its determinant (although this is not the only way)
[2]. This matrix is very sparse. The only possible non-zero entries
are 1,-1,-t and t-1. The dimension of this matrix is identical to the number
of crossings (actually C-1) in the knot projection.
The Alexander polynomial is not a
'perfect' invariant in the sense that it fails to distinguish between certain
knots. Also the Alexander polynomial does not distinguish a knot
from its mirror image (A knot that is distinct from its mirror image is
called 'chiral', otherwise it is 'achiral'.).
For simplicity, the Alexander polynomial
is evaluated at a special value of the variable (D(t=-1)).
This restricts the elements of the Alexander matrix to a small set of integer
values. The correct way to evaluate the Alexander polynomial is then
to evaluate a determinant using only integer operations. For an algorithm
such as Gaussian elimination, the computation time (really the number of
multiplications) should scale as the third power of the matrix dimension
(C-1).
E. Computing
the Vassiliev Invariants
The Vassiliev invariants are
calculated in a combinatorial fashion starting from a Gauss code representing
the knot [3]. For example, a Gauss code for a left-handed trefoil
is b-1,a-2,b-3,a-1,b-2,a-3. The group of three symbols separated
by commas represents a crossing. The letter 'b' (for 'below', and
'a' for 'above') indicates the segment is an underpass (overpass).
The second symbol is the sign mentioned earlier. The third symbol
is a label for the crossing. The label is required in order to know
when the same crossing is going to be traversed a second time (note that
a crossing is encountered twice when making a round trip along the knot
- once as an underpass and once as an overpass). The Gauss code may
be represented pictorially by a chord diagram (Figure 5).
Figure 5. Chord diagram for a knot (trefoil) with Gauss
code b-1,a-2,b-3,a-1,b-2,a-3.
The chords represent the crossings. The arrow head (tail) indicates
an underpass (overpass).
The circle (knot) is traversed along the direction indicated by the
arrow.
The computation of the Vassiliev invariants
is essentially a simple pattern recognition task on the symbols of the
Gauss code. Certain patterns of overpasses and underpasses (which
may be depicted in a diagram like figure 5) are assigned weights and the
contributions of all matching patterns in the knot are added together to
yield a number corresponding to each Vassiliev invariant. The Vassiliev
invariant of degree two (v2) requires a simpler pattern to search for than
that for the Vassiliev invariant of degree three (v3). This implies
that v2 takes much shorter time to evaluate than v3.
The Vassiliev invariants are coefficients
of a certain expansion of a polynomial similar to the Alexander polynomial,
called the Jones polynomial. This polynomial is still not 'perfect',
but in contrast to the Alexander polynomial, the more powerful Jones polynomial
does distinguish between mirror images. Of the two Vassiliev invariants
used, the degree three invariant gives opposite signs for mirror images.
One may deduce from this property that an achiral knot has v3 equal to
zero (See table 1 below).
The Arf invariant, which takes only
values 0 or 1, is obtained somewhat as a byproduct of the calculation of
the Vassiliev invariant of degree two.
Once the knot invariants for a compact
lattice chain are determined, that chain may be assigned a knot type.
For example, the knot invariants for the 4x4x4 chain in figure 1a have
invariants |D(-1)|=3,v2=1,|v3|=1. These
values are identical to the (pre)computed values for a trefoil. So
the chain in figure 1a is classified as a trefoil.
In general, there is as yet no complete
set of knot invariants that could unambiguously identify a knot.
However, by taking the invariants D(-1), v2,
v3 together, the probability of incorrectly classifying a knot is
much reduced. For instance, the set of numbers corresponding to a
trivial knot (D(-1)=1,v2=0,v3=0) are unique
among 'prime' knots with ten or fewer crossings in their projection (249
in number). Also, no knot has yet been found that has the same Jones
polynomial as the trivial knot.
III. Results
A. Average Crossing
Number
The data for average crossing number
versus chain length are shown in figure 6 together with the prediction
(Expression 1). For each chain, the knot
projections onto each of the orthogonal planes (xy,zx,yz) were obtained
and the number of crossings determined for each projection. Each
chain therefore contributed three projections towards computing the average
crossing number. As an aside, the projection (xy,zx or yz) that gives
the least number of crossings (after reduction) is chosen when the time
comes to compute the knot invariants.
Figure 6. Average crossing number versus compact lattice
chain length (N=L3),
plotted in a log-log scale. Prediction and linear fit also shown.
The results for average crossing number is significant in two ways. First, the average crossing number may provide a rough measure of knot complexity for a given chain size. Second, for large L, the expression for the average crossing number (as well as for the upper bound) becomes L4=N(4/3) to within a constant factor. This expression is reminiscent of the 'four-thirds power' law relating average crossing number to 'rope length' for tight knots [5,6].
B. Knot/Unknot Probabilities
The probabilities of occurrence of
the trivial knot (unknot), 3-1 (trefoil), 4-1 (figure-eight), 5-1 (star)
and 5-2 versus the chain length (N=L3 and L=4,6,8,10,12)
are shown together with their knot diagrams (Figures 7,8,9 below).
This list is ordered according to increasing velocity in gel-electrophoresis
experiments (A rationale for this ordering could be that more complicated
knots tend to be more compact and therefore move faster in solution).
These graphs are similar to those of studies made with other chain models
[7]. Notice (Figure 9) that between knots with five crossings
in the projection, 5-2 occurs more often than 5-1 (perhaps because 5-2
looks more asymmetric?). This particular result also agrees with
studies done with other chain models, such as that of [2]. The barely
visible error bars are the computed standard deviations assuming the occurrence
of a particular knot is binomially distributed.
Figure 7. Log-linear plot for probability of occurrence
of trivial knots in compact lattice chains,
together with linear fit (actually exponential but appears as a line
in the plot).
This behaviour is expected as knot complexity increases (trivial knots
become more rare) with chain size.
Left-handed trefoil (3-1). Distinct from the right-handed trefoil
(its mirror image).
Figure eight. Identical to its mirror image (i.e. it is 'achiral').
Star (5-1).
5-2. This knot travels faster than 5-1 in gel-electrophoresis
experiments.
Figure 8. Diagrams for the first four knots. The
trivial knot (unknot) is equivalent to a circle.
Figure 9. Probability of occurrence of the first four
'prime' knots.
| KNOT | Alexander, |D(-1)| | Vassiliev, v2 | |v3| | CHIRAL? |
| Trivial | 1 | 0 | 0 | NO |
| Trefoil, 3-1 | 3 | 1 | 1 | YES |
| Figure eight, 4-1 | 5 | -1 | 0 | NO |
| Star, 5-1 | 5 | 3 | 5 | YES |
| 5-2 | 7 | 2 | 3 | YES |
The present study does not distinguish
between a knot and its mirror image. This means the Vassiliev invariant
of degree three (which flips sign when a mirror image is taken) is determined
and compared only in magnitude.
For the purpose of efficiently counting
how many times the five simple knots occur, the program computed the invariants
in increasing order of computation time. So the invariant that takes
the least amount of time, v2, is computed first. If the resulting
number corresponds to one of the five knots, the next invariant is computed,
D(-1).
If this again corresponds to one of the five knots under study, the slowest-
to-compute invariant, v3, is evaluated last.
C. Sub-chain Statistics
The measurements for the mean-square
end-to-end distance and gyration radius for a range of sub-chain lengths
are shown in figures 10a,b for compact lattice chains of size 12x12x12.
Both indicators of size give a smaller radius for the sub-chains of trivial
knots. We may interpret the results by saying that trivial knots
are less entangled and therefore portions of it tend to form smaller clumps.
These results are encouraging as it reinforces our belief that the method
of knot invariants do indeed give us the correct knot type.
Figure 10a. Measurements of mean square end-to-end distance
versus sub-chain length
made from 109 trivial knots and 187 non-trivial knots, plotted in a
log-log scale.
Figure 10b. Log-log plot of mean square gyration radius
versus sub-chain length.
A formula for the square of gyration radius is
where the r's are radius vectors of the nodes and M is the number of
nodes in the sub-chain.
Figure 10c. Log-log plot of mean square end-to-end versus
sub-chain length (from 10 to 100) with linear fit.
The choice of compact lattice chain
size (12x12x12) was guided by the desire to have the largest possible chain
and not-too-slow program execution time. It took about one day for
a single program to generate and analyze 2500 chains on a supercomputer
(e.g. IBM-SP at the Minnesota Supercomputing Institute) to yield on average
5 trivial knots. The program may still be optimized, in particular
if native math software libraries on the supercomputer are utilized (e.g.
IBM's ESSL).
The range of sub-chain lengths explored
was motivated by the desire to model a linear polymer immersed in a large
sea of other polymers (e.g. a polymer melt). The number of steps
in the sub-chain beyond which the effects of the cube boundaries are 'felt'
is estimated to be about 122=144, as in a random walk.
Indeed, figure 10a shows that the curves begin to flatten at around M=150,
indicating that the boundaries have been encountered by the sub-chains.
IV. Conclusions
A method to analyze the knottedness
of compact lattice chains was described and implemented. The results
for average crossing number and knotting/unknotting probabilities are reminiscent
of those of other authors. In particular, the average crossing number
is related to the chain length via a four-thirds power for large chain
sizes. Also, the unknotting probability drops roughly exponentially
versus chain length. Analysis of sub-chain sizes indicate a difference
between trivial knots and non-trivial knots. There is a range of
sub-chain lengths where sub-chains of trivial knots are smaller on average
than sub-chains of non-trivial knots.
Acknowledgments
Thanks to Professor Alexander Grosberg
for his guidance in this project. Thanks also to Alexander Borovinskiy
for useful discussions.
Some of the images (DNA photo, the
knot diagrams in figure 8) were culled from the web.
Appendix
A. Programs
and Source Code
The methods described in this paper and in the references
were implemented in several computer languages and platforms, most notably
in C++ and Java. The C++ implementation of the knot analysis code
does the real work; it is more comprehensive and couples easily with the
loop/chain generation routines of my colleagues (Alexander Borovinskiy
and Nathan Moore). The Java program is primarily for prototyping
and visualization purposes.
The link to programs and source code is http://www.physics.umn.edu/~rlua/knots.
References
[1] G. Buck and J. Simon, "Energy and length of knots",
Lectures at Knots96 (Ed. S. Suzuki), World Scientific Publ., 1997, 219-234.
[2] A.V. Vologodskii, A.V. Lukashin, M.D. Frank-Kamenetskii,
and V.V. Anshelevich, Zh. Eksp. Fiz. 66, 2153 (1974) [Sov. Phys. JETP 39,
1059 (1974)]
[3] M. Polyak and O. Viro, Gauss Diagram Formulas for
Vassiliev Invariants, preprint.
[4] R. Ramakrishnan, J.F. Pekny, and J.M. Caruthers,
J. Chem. Phys. 103, 7592 (1995).
[5] J. Cantarella, R.B. Kusner and J.M. Sullivan, Tight
Knot Values Deviate from Linear Relations, Nature, 392 (1998) pp. 237-238.
[6] G. Buck, Four-thirds Power Law for Knots and Links,
Nature, 392 (1998), pp. 238-239.
[7] T. Deguchi and K. Tsurusaki, "Random Knots and Links
and Applications to Polymer Physics", Lectures at Knots96 (Ed. S. Suzuki),
World Scientific Publ., 1997, 95-122.