diff options
author | sanine <sanine.not@pm.me> | 2022-01-24 19:34:28 -0600 |
---|---|---|
committer | sanine <sanine.not@pm.me> | 2022-01-24 19:34:28 -0600 |
commit | ec20b587362d76d6c48ecc1a5c1e65f1bb9293da (patch) | |
tree | e6739fc0925b007eee136a1b90a5d3d0abba8494 |
initial commit
-rw-r--r-- | reference/sphere.faq | 761 | ||||
-rw-r--r-- | reference/sphere.rand | 157 |
2 files changed, 918 insertions, 0 deletions
diff --git a/reference/sphere.faq b/reference/sphere.faq new file mode 100644 index 0000000..cc60ca5 --- /dev/null +++ b/reference/sphere.faq @@ -0,0 +1,761 @@ + TOPICS ON SPHERE DISTRIBUTIONS + +[Latest version of 1998/02/26 by Dave Rusin, rusin@math.niu.edu. +This file is available by FTP (or gopher) at + ftp://math.niu.edu/pub/papers/Rusin/known-math/95/sphere.faq +but I recommend instead this URL which gives links to some related areas: + http://www.math.niu.edu/~rusin/known-math/index/spheres.html +If you're reading this, send me your comments and suggestions.] + +I'm writing this FAQ because of the frequency with which topics related +to this subject appear in the USENET newsgroups such as sci.math. This +isn't really my main area of interest so some of the comments are +pretty simple-minded; if you are a power user in need of guidance, you +should look elsewhere for suggestions, while conversely if you can +offer expert guidance I'd like to incorporate it here. Some of the +material already here is lifted from other sources; whenever possible +I try to assign the blame^H^H^H^H^H credit to someone else. + +The overall main topic is the question of how to place points on the +sphere, or how to divide up the sphere into regions, in a regular way. +Along the way we'll talk about map-making, Platonic solids, and so on. +The following questions will be addressed (other suggestions welcome): + +Q0. Notation +Q1. What does it mean to place N points "evenly" on a sphere? +Q2. Are there some values of N which are better than others? +Q3. How can you describe locations on a sphere, anyway? +Q4. Can you give a quick approximation to a good distribution? (*) +Q5. How can we improve an approximate solution? (*) +Q6. What if I want randomly to place points with a uniform distribution? +Q7. Can this be generalized to higher-dimensional spheres? +Q8. How is this related to collections of linear subspaces? +Q9. Where can I get the coordinates of the vertices of the Platonic solids? +Q10. How come this sounds like sphere packing? + +(*) Fetch the accompanying file sphere.bas for a BASIC program demonstrating +these techniques. + +Other topics possibly to include in the future: + What's the formula for the volume of the n-ball? n-sphere? + What's this I hear about exotic differentiable structures on spheres? + Can you describe projective space and its relation to spheres? + What are sources for further information? +If you need some information about these topics, let me know; that will be +my clue that it's time to add something to the FAQ. + +At some point I may separate this into a 2-sphere FAQ and a hi-dimensional- +sphere FAQ. If this seems important, let me know and I'll move it up my +priority list. + +dave + +============================================================================== + ANSWERS TO THE FREQUENTLY-ASKED QUESTIONS +============================================================================== + +Q0. Notation +Just in case there is some confusion I'll informally make the definitions +I'll need later. + EUCLIDEAN SPACE (or n-space, or R^n) is the set of n-tuples of +real numbers. We will use the ordinary metric (distance function). Note +that we commonly speak of ourselves as living in 3-space. + The SPHERE of radius r in R^n is the set of points whose +coordinates satisfy x1^2+...+xn^2=r^2; the unit sphere is the sphere of +radius 1. Note that I am explicitly assuming all spheres are centered +at the origin. The interior of the sphere is called the BALL in R^n; +it is the set of points of distance less than r from the origin. (The +ball and the sphere are disjoint; their union is sometimes called the +n-DISK .) If not otherwise specified we are usually referring to the sphere or +ball in R^3. + The DIMENSION of a (reasonable) subset of R^n is the number of +independent directions one can travel in the set. For example, in the +ordinary sphere motion at most points is a combination of movements +East and North, so the sphere in R^3 is two-dimensional (not three). +The ball in R^n is n-dimensional. + Note that the circle is the sphere (of dimension 1) in R^2; +the sphere in R^1 is two points (+1 and -1). We often write S^n for the +n-dimensional sphere (in R^(n+1).) + A LINE is the set of multiples of a point or vector (other +than the origin). Note in particular that I am referring only to lines +thru the origin. A PLANE is the set of linear combinations of two +non-collinear vectors. (Don't call it a SURFACE ; that's usually +understood to be anything 2-dimensional, including planes, the +2-sphere, paraboloids, and so on.) Likewise we can take any k +linearly independent vectors in R^n and look at all linear +combinations of them; this set is called a LINEAR SUBSPACE of R^n and +is of dimension k. When k=n-1, the set is called a HYPERPLANE. + + The focus of this document is on S^2 but from time to time I +consider greater generality when it is easy to do so. + +============================================================================== + +Q1. What does it mean to place N points "evenly" on a sphere? + +Well, what do you want it to mean? Several valid interpretations exist; +they lead to slightly different distributions of points. + +If you want all distances between distinct points in the set to be equal, +you can have at most 4 points. Indeed, it's easy to see any 3 +equally spaced points mark an equilateral triangle and any 4 must then +mark a regular triangular pyramid. You can have a 5th point equidistant +from the first 3 (glue two such pyramids face to face) but then the +4th and 5th points are too far apart. + +But everyone agrees that the vertices of a regular polygon are equally +spaced in S^1, so there must be another definition we have in +mind. One can ask that every point have some property, such as being +equidistant from its nearest neighbor. Such distributions exist -- for +example, one can space the points equally along the equator of the +sphere. Still, this is not likely to be what we have in mind when we +ask for a uniform distribution of the points. + +One might ask that the points be arranged "symmetrically" on the sphere in +some way; in Q3 we'll discuss how to do this, but true symmetry is available +only for a few values of N. + +It is more helpful to measure uniformity not point-by-point but by considering +some aggregate measure of the dispersion. For example, we could let D be +the minimum distance between any pair of distinct points in the set. This +provides a measurement of how good or poor a distribution we've chosen. +For example, four points equally spaced on the equator have D=sqrt(2)=1.41... +whereas four points on the pyramid have a larger D=(2/3)sqrt(6)=1.63... +So we might ask for an arrangement on the sphere which maximizes D. + +In general this statement of the problem guarantees a solution exists. +Let D be any continuous function of N points on the sphere; then since +the sphere is compact, so is the Cartesian product of N copies of it, +so that D is a continuous function on a compact set, and so achieves a +maximum and a minimum. If D is chosen intelligently, the values of the +variables that lead to an extreme value of D will give an arrangement of +points which is arguably "uniform". + +There are some drawbacks with this perspective. For one, the extreme +values of D will be attained more than once. (Certainly if D depends +symmetrically on the points, any permutation of the N variables will +provide another occurrence of the extreme value. Indeed, it is +reasonable to consider only functions defined on the symmetric product +S^N(S) =(SxSx...xS)/Sym_n rather than on the Cartesian product +(SxSx...xS) of N copies of the sphere S) If D is invariant under +rotations of the sphere, which is reasonable, then in order to hope to +have an isolated extreme configuration, one needs to hold one point +fixed at, say, the North pole, and to allow another point to vary only +along a meridian attached to the pole. + +Another obstacle is that there are several natural choices for D, and an +arrangement of points which is optimal for one such function need not be +optimal for another. Here are some natural choices for D: + D1=min_i min_{j<>i} dist( Pi, Pj ) (the function considered above) + D2=(1/N) Sum_i min_{j<>i} dist( Pi, Pj ) ("average dist to a neighbor") + D3=Sum_i Sum_{i<>j} 1/dist( Pi, Pj ) ("potential energy") +If an electron is placed at each of the points, D3 is the sum of the +electrostatic potentials at these N electrons; they would tend to spread +apart if possible so as to minimize D3. + +Finally, there is the question of how one would compute an optimal +configuration once a decision is made of how optimality would be measured. +If D is a differentiable function of the coordinates of the points, then +the problem can be solved with Lagrange multipliers, but this is hopelessly +cumbersome for any but the smallest values of N and the simplest +functions D. Moreover, some of the most natural functions D to take are +not differentiable everywhere. We shall return to this in Q5. +============================================================================== + +Q2. Are there some values of N which are better than others? + +You bet! As has been pointed out above, values of N <= 4 have very natural +solutions. In addition, there are solutions for some N related to the +Platonic solids. Here's why. + +Suppose we had an arrangement of N points with the following property: +given any two points there is a rotation of the sphere which + (1) carries the one to the other and in the process + (2) carries each of the points in the distribution to some other point + in the set (possibly itself). +These rotations generate a group (a subgroup of SO(3) ) of +permutations of the N points; since the only rotation of the sphere +which fixes two points not antipodal to each other is the identity +(the trivial rotation), the group of all these rotations is finite +(unless N=2). + +Well, as it happens, there are not many finite subgroups of SO(3). + +First, there are all the cyclic subgroups which are sets of rotations +with a common axis, as well as the dihedral groups which adjoin one +rotation about an axis perpendicular to this one. If this is the +symmetry group of your arrangement of points, then all your points +lie on one circle (two circles, if you have a dihedral group and a +non-equatorial arrangement). + +The remaining finite subgroups of SO(3) are the symmetry group of the +pyramid (isomorphic to Alt(4), it has order 12), the symmetry group of +the cube (it's the full symmetric group of order 24, permuting the 4 +diagonals) and the symmetry group of the dodecahedron (the symmetric +group of order 120). The octahedron is dual to the cube and the +icosahedron is dual to the dodecahedron, while the tetrahedron +(pyramid) is self-dual. (Here "dual" means vertices of one are +midpoints of faces of the other, so dual polyhedra have the same +symmetry group.) (For a full analysis of the Platonic solids, see +Coxeter, "Introduction to Geometry", Wiley (NY) 1969.) + +It's not quite true that a set of points permuted by one of these three +groups has to be the set of vertices of a Platonic solid (as a glance +at a soccer ball [non-US: football] will show). However, a firm +insistence that there be sufficient rotations to carry each vertex +to any other limits the number of points to 120. + +So there are only finitely many non-circular arrangements with this high +degree of symmetry. Of course, you could relax the condition that there be +rotations preserving the set of points which permute the points +transitively (any point can be carried to any other one). For example, you +could begin with one of the Platonic solids and mark off, say, the +midpoint of each edge as well as each of the vertices. (The former need +to be scaled of course to make them lie on the sphere.) There is a large +collection of figures that exhibit some partial symmetry. In general, the +more symmetry you require, the fewer the number of values of N which +you can expect to place so symmetrically. + +If you are working with spheres of a higher dimension, there are more +finite groups of rotations; the orbits of a point under these groups will +give sets of points which could reasonably be called "uniform". Indeed, +given any set of points on a sphere (or any other metric space) you can +define the Voronoi region of each point to be the set of elements in the +sphere closer to that point than to any other. Then if the rotation group +acts transitively on the points, it provides enough motions to show that +all the Voronoi regions are congruent. A couple of theorems are relevant +here, I guess: first, given any group G there is a group of rotations +in SO(n) isomorphic to G as long as n is sufficiently large. Second, +given any n there may be infinitely many finite subgroups in SO(n) +(as the cyclic subgroups of SO(3) demonstrate) but there are only finitely +many quotient groups G / Z(G); that is, the G will have an (Abelian) +center of bounded index. + +Departing from these geometric considerations, we will see in Q4 that +some approximate constructions of uniform arrangements are a little +easier for some N's than others, so if you have some freedom in the +selection of N you can make your job a little easier. + +============================================================================== + +Q3. How can you describe locations on a sphere, anyway? + +Clearly any point on a sphere also sits in the ambient Euclidean space, +and so may be specified by its ordinary Cartesian coordinates. But these +coordinates are bound by the equation of the sphere, and it is sometimes +preferable to use coordinates which are unbound. + +Since the sphere in R^n is (n-1)-dimensional, it is natural to try to +parameterize your position with just n-1 variables. This is well known +on the ordinary 2-sphere, where we use latitude and longitude. Such +parameterizations are possible, although there are some nettlesome points +to keep in mind. + +Ideally, a parameterization of the n-sphere would be a function P +defined on a subset of Euclidean space R^n and taking values in the +n-sphere, having the following properties: + 1) P is continuous (i.e. a continuous change in R^n should yield + a continuous change on the sphere. Equivalently, P + should be given by (n+1) continuous coordinate functions + whose squares sum to 1. ) + 2) P is one-to-one: a point on the sphere shouldn't show up more + than once during the parameterization. + 3) P has a continuous inverse F, that is, a coordinate function + which determines for each point in the sphere what values + of the parameters are needed to map to that point. + The existence of such an F implies P would be onto. + 4) P (and/or F) preserves important geometric information like + angles or areas (or the n-dimensional equivalents). + 5) P (and/or F) is differentiable. +Unfortunately, for topological reasons there can be no function P +meeting even just (1) (2) and (3). This is well-known to map-makers +trying the case n=2: any map of the earth's surface has to rip the +surface somewhere. + +However, there are functions which meet some of these conditions; various ones +are appropriate under different conditions. + +1. If you only want to parameterize _part_ of the sphere, you +can get away with a simple projection: use F(x1,...,x{n+1})=(x1,...,xn) +and P(x1,...,xn)=(x1,...,xn, sqrt(1-x1^2-...-xn^2) ). Use the negative +square root to parameterize the bottom half of the sphere. If you need +to have a parameterization valid a neighborhood of some point on the +equator, have F drop out some other coordinate (which P puts +back in with a similar square root formula). + +2. You can get a correspondence P mapping onto all of the sphere +except one point (say, the North pole "NP"); indeed you can define +P : R^n --> S-{NP} and an inverse map F going the other way which are +continuous, differentiable, one-to-one, and onto. Do this with +stereographic projection: place the plane thru the equator of the +sphere and draw rays emanating from the North Pole and pointing +downward some non-zero amount. Each such ray intersects both the +sphere and the plane in one point, and all the points in S-{NP} and +R^n are so met. These rays then set up the one-to-one +correspondence. The formulas are + P(x1,...,xn) = t.(x1,...,xn,0) + (1-t)(0,...,0,1) + F(x1,...,x{n+1})= r . (x1, ..., xn) +(where (0,...,0,1) is the North Pole on the n-sphere in R^{n+1}.) Here +t is chosen to make the distance from the origin be 1 : t=2/(1+x1^2+...+xn^2). +Also r is chosen to invert the first formula: r=1/(1-x{n+1}) + +These formulas are the basis of the polar projection used by map-makers. + +3. If it is necessary to have a parameterization which preserves area +(or its n-dimensional equivalent), this coordinate system F can be +modified by adjusting the function r=r(x{n+1}). + +Here's how we decide what the right function r is. Consider the set of +points in S^n with x{n+1}=a. This is an (n-1) sphere with radius +h=sqrt(1-a^2). Then for small e the set of points in S^n with +x{n+1} lying between a and a+e is a slightly bent version of the +product of an (n-1)-sphere of radius h and an interval of length +e.sqrt[1+(dh/da)^2] = e/sqrt(1-a^2). The function F above carries it +to another such product, but this time the radius is z(a)=r(a)h(a) and +the length of the interval is about e.z'(a). Since F changed the radius +by a factor z(a)/h, the (n-1)-volume will change by a factor +of [z(a)/h]^(n-1). If we want to keep the n-volume unchanged, we need +the ratio of the lengths of the interval, roughly z'(a) sqrt(1-a^2), to +be the reciprocal of this, that is, we need z'(a).z(a)^(n-1) = +h^(n-1)/sqrt(1-a^2) = sqrt(1-a^2)^(n-2). Solving this differential +equation shows we need + r(a) = z(a)/h(a) = [1/sqrt(1-a^2)].[n.integral(1-a^2)^(n/2-1) da]^(1/n) +(As z(-1)=0, the integral should be selected to vanish at a=-1.) + +So r(x)=[1/sqrt(1-x^2)].[n. integral_{-1}^{x} (1-a^2)^(n/2 - 1) da]^(1/n). +For n=2, this gives r(a)=sqrt(2/(1-a)), i.e. z(a)=sqrt(2*(1+a)). In +particular, this will map the whole sphere to the disk of radius 2, +whose area is then 4pi (the area of the sphere) as it should be. + +4. There are also trigonometric parameterizations. +For example, it is easy to parameterize the 1-sphere (the circle) with a +parameter t using P(t) = (cos(t), sin(t)). Here if we really want P +to be one-to-one we need to restrict to an interval such as [0, 2 pi). +This is then indeed one-to-one, onto, and continuous (even differentiable) +and as a bonus preserves lengths. But there can be no continuous inverse +function. F(x,y)=arctan(y/x) will do if branches of the arctan function +are chosen appropriately, but no set of choices can be made consistently +in all quadrants of S^1. + +Still, this parameterization is useful, and admits generalization. The +function + P(u,v) = (cos(u)cos(v), cos(u)sin(v), sin(u)) +is also differentiable, maps onto S^2, and is one-to-one on the set +(-pi/2,pi/2)x[0,2 pi). (The image of P on this set misses the poles, +however.) This P has an inverse in the sense we had on the circle: +(u,v)=F(x,y,z)=(arctan(z/sqrt(x^2+y^2)), arctan(y/x) ). + This is the ordinary parameterization of the sphere in terms of +latitude (u) and longitude (v). By computing the Jacobian for the change of +variable to spherical coordinates, we see that this parameterization +changes areas by a factor of |cos(u)|. + +This may be generalized to the n-sphere: use the function P_n, where +P_n(u1,...,un) = ( cos(un)*P_{n-1}(u1,...,u{n-1}) , sin(un) ) +(for each un in (-pi/2, pi/2), the parameterization traces out an +(n-1)-sphere using P_(n-1). ); n-dimensional volume is reduced by a +factor of | cos(u2) . cos(u3)^2 . ... . cos(un)^(n-1) |. +(I'll discuss another interpretation of this product in Q6.) + +These parameterizations form the basis of various cylindrical maps of +the earth (rectangular maps excluding the poles in which the left and +right edges represent the same points on earth). The Mercator projection +is chosen to remove the distortion of angles; thus (locally) the +Mercator projection gives an accurate representation of shapes. But the +correction used is precisely the inverse of what is needed to correct +the |cos(u)| change in area, so that the Mercator projection is wildly +inaccurate at portraying areas (especially near the poles). I believe +the 'Peters projection' is the name of the map which corrects areas (but +which amplifies the distortions of angles, and thus of shapes). + +5. If the trigonometric expressions are too computation-intensive, +one can use a parameterization requiring only rational functions. Note first +that the circle may be parameterized using a variation of stereographic +projection: + P(t) = ( -2t/(1+t^2) , (1-t^2)/(1+t^2) ) + F(x,y)= (y-1) /x +Thus, we can replace all sine/cosine pairs with these two functions. +For example, the 2-sphere may be parameterized: +P(u,v)=( 4uv/[(1+u^2)(1+v^2)], -2u(1-v^2)/[(1+u^2)(1+v^2)], (1-u^2)/(1+u^2) ) +F(x,y,z)= ( (z-1)/sqrt(x^2+y^2), (y-1)/x ) + +For further details, consult an advanced calculus text; look up the +change-of-variables theorem, particularly spherical coordinates and +the role of the Jacobian (derivative) in determining areas. +[I'll add a reference to the mathematics of map-making as soon as I find one.] + +============================================================================== + +Q4. Can you give a quick approximation to a good distribution? + +Of course. Mark off K equal segments along a meridian using K-1 +points plus the two endpoints. These segments have length pi/K. If you +swing this around the sphere, you'll mark off K-1 circles. Now, in +spherical coordinates, these circles are the points with +ui=-pi/2 +i*(pi/K), where i = 1, 2, ..., K-1. They have radius cos(ui), +hence length 2pi*cos(ui). Thus you can divide them into about +[2pi*cos(ui)]/[pi/K] segments as long as the segments on the meridian. +So take Mi to be an integer near 2*K*cos( -pi/2 + i*(pi/K)) = +2*K*sin( i * (pi/K) ), and mark off points that have u=ui and +v=j*(2*pi/Mi) for j = 1, 2, ..., Mi. This will give you +2+Sum{i=1,...,K-1} M_i points on the sphere. + +To within an error less than K-1, this is the same as +2+Sum 2*K*sin( i* pi/K ) = 2 + 2*K*cot( pi / 2K ). For large values of K, +this is about (4/pi) K^2. (With an area of 4pi on the whole sphere, +this leaves an area of (pi/K)^2 around each point, which is not +surprising since initially the points are about pi/K apart on a +roughly square grid). + +If you have the luxury of placing _approximately_ N points on the +sphere, begin with K an integer roughly equal to sqrt((pi/4)N), and +compute the number of points which would be placed with the method +above; you should get about N points. Here are the values I get +for the first few K using round() to convert to integers when necessary: +for K=1, N=2; for K=2, N=6; then for subsequent K (thru K=22), I get N= +12,22,34,46,82,106,128,156,184,214,248,288,328,368,412,460,510,562,616,... + +Notice that the first layer around the north pole has about 2Ksin(pi/K) +points. For K large, this is about 2*pi = 6.28, so we would take M1=6, +and arrange the first circle of points in the "kissing pennies" pattern. + +This arrangement of points on the sphere has an obvious regularity, +but the points are not spaced as far apart as possible by any reasonable +measure -- for example, roughly sqrt(K/5) of the rows nearest the +equator will have about the same number of points in them, roughly in +a square-looking grid; clearly a honeycomb distribution will place the +same number of points with a greater spacing. This leads to question Q5. + +I have included as a separate file a BASIC program which carries out this +distribution. In view of the previous paragraph, this code can be +improved when N is large, but I'm not really sure of an optimal way to +do so, so I won't. + +============================================================================== + +Q5. How can we improve an approximate solution? + +If the approximate solution in Q4 is insufficient, you'll have to decide why. +This returns you to the discussion on Q1. Let me assume that there is some +function D of the positions of the points which you wish to minimize +or maximize. (Replacing D by -D as necessary we may assume D is to be +maximized.) + +The method outlined in Q4 should give a low value of D. If D is +differentiable, you can use the method of gradients: you adjust the positions +of the points in the direction of the gradient. Professionals in +numerical analysis should scoff at this naive approach -- for one thing, +this method assumes that the critical point is an isolated minimum; also, +it is not clear just how far in the direction of the gradient to +proceed so as not to overshoot the region of convergence; moreover, once +in the region of convergence, the rate of convergence can be quite slow. +However, in practice I have found that if you have a moderate number of +points to place and you are not too demanding in your search for an +absolute minimum, this method works quickly enough if you begin with a +moderately uniform distribution of points. (I would appreciate learning +of improvements from numerical analysts.) + +Here is what the method of gradients means in more detail. If you +have parameterized the sphere and expressed D as a function of N sets +of the parameters, compute the partial derivatives of D with respect +to all these variables, and then evaluate at the present values of the +parameters. Add to each of the parameters a small multiple of this +partial derivative. + +If instead you have calculated D as a function of the Cartesian +coordinates of the N points, you need to take the portion of the +gradient which points in a direction tangent to the sphere. In +practice, this can be accomplished by adjusting all coordinates of the +points as in the preceding paragraph, then rescaling each point's +coordinates to ensure the vector is of length one. + +There is the possibility of creep to be concerned with. That is, it is +possible that with each iteration the function D is not improved; +rather, the points have simply all rotated somewhat around the sphere. +Under the assumption that D is invariant under rotations, there is no +need to allow this kind of motion. One can negate this possibility by +rotating the sphere after each iteration so that one point (say the +North pole) is returned to its previous position and another point +is returned to the meridian it was on previously. + +I have put a BASIC program alongside this file to illustrate these +computations. Since the function D I used polls each pair of points +to find out the repulsion between them, the running time for one +iteration is proportional to N^2. Thus I hardly recommend it for +values of N above a couple of hundred if you are using, say, a +personal computer. + +============================================================================== + +Q6. What if I want randomly to place points with a uniform distribution? + +This is perhaps the easiest way: randomly select the Cartesian +coordinates independently. Toss out any sets of coordinates which take +you outside the sphere (and, should you be so unlucky, any occurrences +of the origin). Scale all the remaining points to push them out to the sphere. + +Another easy way is to use the area-preserving parameterizations of +Q3. If you use the cartographer's polar projection (the third +parameterization in Q3), you need only select random points in the +disk with uniform distribution there, then map each of the selected +points onto the sphere. Of course, you'll never hit the North Pole +this way, but that event was supposed to occur with probability zero +anyway. (The simplest way to select random points in the disk is just +to pick the n coordinates separately, each randomly in [-1,1]. Then +simply discard the n-tuples not lying in the disk.) + +You can also use the trigonometric parameterizations of the sphere +(the cartographer's cylindrical projections): if you are placing +points randomly, the probability that you will have to consider one of +the points where the parameterization fails to be one-to-one, or +continuous, say, is zero. + +However, you need to exercise some care for spheres of dimension +greater than 1 since in that case, the trig parameterizations are not +area-preserving. For example, if you use the spherical coordinates +above to parameterize the 2-sphere, and select u and v randomly in +their respective intervals, you'll find too many points chosen near +the poles. This is because the sets of points with u fixed near ++-pi/2 are only _small_ circles; it is not appropriate to select these +u's as frequently. Indeed, the frequency with which a given u is +selected should be proportional to the circumference of that circle, +which is 2pi |cos(u)|. + +More generally, one can randomly select points in the n-sphere by randomly +selecting the n spherical parameters, but always remembering to select +each u_i with a frequency proportional to the (i-1)-dimensional +measure of the the set of points where u_i is a constant. Fortunately, +we know that this set is just an (i-1) sphere whose radius is cos(u_i) +and whose measure is then proportional to cos(u_i)^(i-1). + +Thus the problem of random point selection on the sphere reduces to this +question: how can one select values of a variable u from an interval +[a,b] so as to achieve a given frequency distribution f? One way is to +let C be the cumulative distribution function, C(u)=integral_a^u f(t)dt. +(This is the probability that the chosen value will be between a and u.) +For then if G is the inverse of this function, then we may select +x randomly with uniform distribution across [0,1] and let y=G(x). This is +a random variable whose distribution function looks like this: + Prob( h < y < h+e ) = Prob( h < G(x) < h+e ) + = Prob(C(h) < x < C(h+e) ) which is roughly + = Prob( C(h) < x < C(h)+C'(h)e ) +using differentials. Well, if x is selected with uniform distribution, +this probability is the length of that interval, namely C'(h)e = f(h).e +since C'=f. But to say that this original probability is roughly f(h)e +is precisely to say that the distribution function for the variable y is f. + +With the frequency distributions at hand, this is not difficult. The +first coordinate u1 is selected uniformly in [0,2 pi). The second +coordinate u2 is to have a probability density function cos(u_2) /2 +on (-pi/2, pi/2) [The extra factor of 1/2 is just a scaling needed to +ensure that the cumulative distribution C(u) ends with C(pi/2) = 1.0]. +Indeed, C(u) = (sin(u)+1)/2 has inverse G(w)= arcsin(2w-1), so one can +get points uniformly on the 2-sphere using the usual trig parameterization +with v selected uniformly in [0, 2pi) and u=arcsin(2w-1) where +w is selected uniformly in [0,1]. The Cartesian coordinates are then +(sqrt(4x-4x^2).cos(v), sqrt(4x-4x^2).sin(v), 2x-1). + +With a slight change of variables, this can be viewed as the equal-area +cylindrical projection in cartography: you can parameterize the surface +of the 2-sphere with coordinates x in [-1, 1] and v in [0,2 pi) +in such a way as to preserve area by mapping (x,v) to +(sqrt(1-x^2) cos(v), sqrt(1-x^2) sin(v), x). This is the projection +which carries points on the sphere (minus the poles) to points on a +cylinder wrapped around the equator by shining a light straight out from +the axis through the sphere to the cylinder. + +On the 3-sphere (and higher) the next variable is selected as u3=G(x3) +where x3 is selected uniformly in [0,1] and G is the inverse of +the function C(u) = (1/2)(u+pi/2+sin(u)cos(u)). The next variable +after that is selected similarly using C(u) = ( cos(u) - sin^3(u)/3 - 1/3). +And so on. + +Observe that if C can be computed, it is not necessary to be able to +solve for its inverse G explicitly. Instead, each of the computations of +y=G(x) may be written x=C(y), an equation to be solved for y when x +is known. This is easily handled with Newton's method. + +I believe a method of generating numbers randomly with a given distribution +is treated in Knuth's Art of Computer Programming. + +============================================================================== +Q7. Can this be generalized to higher-dimensional spheres? + +I have melded this into the previous questions -- anyone reading this +who can think of questions particular to higher dimensions should let me +know what ought to go in here. + +============================================================================== + +Q8. How is this related to collections of linear subspaces? + +I put this question here because someone asked me recently about how +to parameterize the set of planes and other linear subspaces of Euclidean +space. I can't quite remember the question, so I'll not make an +answer at this point. + +One comment to make is that points on the sphere (or, more precisely, +pairs of antipodal points) correspond to lines through the origin in +R^n. The planes in R^n correspond similarly to great circles (circles +of maximal circumference). If n=3, the planes can be parameterized in +a natural way by finding the line perpendicular to them. If you +trace these correspondences through, you'll see that there is a +natural one-to-one correspondence between great circles on the 2-sphere +and pairs of antipodal points. You can get the circle from the points +as the set of points equidistant from the pair; you can get the pair from +the circle as the only points equidistant from all elements of the circle. + +More generally, there is a correspondence between the k-dimensional +and the n-k dimensional subspaces of R^n. The family of all k-dimensional +subspaces in R^n (or n-k dimensional subspaces) is known as the +Grassmannian manifold; it is indeed a manifold and has dimension +k(n-k). When k=1 or k=n-1, this is also known as projective n-1 space; +there is a two-to-one mapping S^{n-1} --> RP^{n-1} which simply sends +a point to the line it's on (this map sends antipodal points of S^{n-1} +to the same element of RP^{n-1} of course. This map turns S^{n-1} +into a two-fold cover of RP^{n-1}; for n>2 this is the universal +covering of projective space.) + +This material is often treated in graduate topology courses, particularly +in a discussion on vector bundles in an algebraic topology course. +You may wish to consult Milnor's book "Characteristic Classes" +(Princeton Univ Press), which is however not elementary. + +============================================================================== + +Q9. Where can I get the coordinates of the vertices of the Platonic solids? + +There are 5 Platonic solids. An interesting but brief discussion (with +pictures) may be found, for example, in Gallian's "Contemporary +Abstract Algebra", D C Heath 1986. + +Solid Number of Vertices # Edges # Faces +Tetrahedron 4 6 4 +Cube 8 12 6 +Octahedron 6 12 8 +Dodecahedron 20 30 12 +Icosahedron 12 30 20 + +Notice the duality between certain pairs. Also note that V-E+F=2 in +all cases. This is deep, and may be thought of as the first fact in +homology theory. + +I will give coordinates of these points (and in indication of the rest +of the structure) in the most compact way I know; if you want the +coordinates so that, for example, the North Pole is one of the vertices, +you'll have to rotate the coordinates (the BASIC program I include has +in it an application of this procedure.) +I have scaled the coordinates so that they lie in the unit sphere, but +this is not always the prettiest form. I have however set it up so that +the coordinates of the dual pairs reflect this duality. + +Tetrahedron +The points are (0,0,1), (1/3)(2sqrt(2), 0, -1), and both points +(1/3)(-sqrt(2), +-sqrt(6), -1). All pairs are joined by an edge; all +triples (sets of three points) lie on a common face. + +Cube +The 8 points are (+-c, +-c, +-c) where c=1/sqrt(3) and the plus/minus signs +may be chosen independently. +The 12 edges join pairs of points which have 2 coordinates equal. +The 6 faces are quadruples of points for which one of the coordinates is + constant. + +Octahedron +The 6 vertices are of three types: (+-1, 0, 0) (0, +-1, 0) and (0, 0, +-1). +The 12 edges join all pairs except the three pairs of antipodal points. +The 8 faces consist of all triples which include one point of each type. + +Icosahedron +(using some information in a post by cjhenrich@delphi.com) +The 12 vertices are of three types: (+-a, +-b, 0), (0, +-a, +-b), and + (+-b, 0, +-a), where a=sqrt((5+sqrt(5))/10) and b=sqrt((5-sqrt(5))/10) +The 30 edges come in two flavors: there are 24 edges joining points of + different types such that their common non-zero coordinate is of + the same sign (e.g., (a, -b, 0) and (0, -a, b) have negative 2nd + coordinates). There are also 6 edges joining pairs of points of + the same type for which the sign in front of the a are equal + (e.g. (a, b, 0) and (a, -b, 0).) +The 20 faces are also of two general sorts: There are 8 triplets consisting + of one point of each type such that whenever two of the points + have a common non-zero coordinate, those coordinates are of the + same sign. Also, there are 12 triplets which contain two points of + the same type having the same sign 'X' in front of the a, and a + third point having 'X'.b in that coordinate (e.g., given the + two points at the end of the last paragraph, the third point on + a face with them is (b,0,+-a). ) + +Dodecahedron: This is dual to the icosahedron +I adapted these coordinates from a post by David Seal. +The 20 vertices include 8 vertices (+-c, +-c, +-c) (c=1/sqrt(3)) and + 12 of the forms (+-x, +-y, 0), (0, +-x, +-y), (+-y, 0, +-x) where + x=(sqrt(5)+1)/(2*sqrt(3)) and y=(sqrt(5)-1)/(2*sqrt(3)). +I will leave it to the reader to dualize the construction of the edges + and vertices of the icosahedron to get the edges and faces of + the dodecahedron. (That means I started it but it took too long. + The edges are the pairs of points of distance sqrt( (2/9)(sqrt(5)-1) ) + from each other.) + +============================================================================== + +Q10. How come this sounds like sphere packing? + +Very observant of you! Actually, there are a lot of questions whose +answers sound like an enumeration of the Platonic solids. Here is one +way to see a connection. + +Suppose you have a packing of Euclidean space with spheres of radius 1. +There is a way to get evenly spaced points on the unit sphere from it. + +For any r>0 let a_r be the number of spheres in the packing which are +contained in the ball of radius r centered at the origin. Note that +if V is the volume of the ball of radius 1 in R^n, then the ball of +radius r has volume V.r^n, so we have an upper bound of a_r <= r^n. +The limit of a_r to r^n as r increases without bound is the 'packing +density' d. Thus for large r we have a_r is roughly d r^n. In +particular, there are roughly d.n.r^(n-1) balls which fit in the +sphere of radius r+1 but not in the sphere of radius r. If we take the +centers of just these balls and scale them back just a bit, we will +get roughly d.n.r^(n-1) points on the sphere of radius r. Since the +sphere centers are of distance at least 2 from each other, the points +on the sphere will be almost 2 units apart, or further. + +Scaling this picture by a factor of r, we get about d.n.r^(n-1) points +on the unit sphere, with distances between them not much less than +2/r. This is not really better than we achieved with the method in Q4, +but it has the advantage that we can also place points on the sphere +corresponding to spheres in the packing which, for example, fit in +the ball of radius r+1.5, or r+2. Possibly these points, when projected +to the sphere of radius r, will lie close to other points already +placed, but then again, maybe not. + +It is not possible, so far as I know, to go backwards and create a +sphere packing from a distribution on a sphere. The difficulty is that +a sphere packing implies a distribution of arbitrarily many points on +the sphere, and gives these distributions in a coherent way, whereas +an individual distribution is done with just one fixed number of points. + +There is abundant literature and a fair amount of controversy about +sphere packings. Certainly the easiest sphere packings to understand +and use are the ones with great regularity: one places a sphere +centered at each point in a crystallographic lattice. An introduction +to these may be found, for example, in Durbin's "Modern algebra" +(Wiley, 1985). One looks at a subgroup of the group of translations in +R^n which is discrete (each group element is far from any other one); +the translates of a single point under this group form a lattice. The +group of symmetries of such a lattice is called a crystallographic +group. Note that there may be some symmetries which leave a point +fixed; the group of these is finite, and indeed there are only a few +possible groups: dropping, if necessary, to a subgroup of index two, +we can assume the group contains only rotations, at which point it +must be cyclic of order 1, 2, 3, 4, or 6; dihedral of order 4, 6, 8, +or 12, or the symmetry groups of the tetrahedron or octahedron. + +This topic comes up from time to time in the popular press, during +which an interview with the inimitable John Conway is de rigueur. +There is also some very interesting work on lattices in higher +dimensions, particularly dimensions which are a multiple of 8, such as +the important Leech lattice. The book by Conway and Sloane is full of +such tidbits. + +============================================================================== + diff --git a/reference/sphere.rand b/reference/sphere.rand new file mode 100644 index 0000000..bf56705 --- /dev/null +++ b/reference/sphere.rand @@ -0,0 +1,157 @@ +From: ags@seaman.cc.purdue.edu (Dave Seaman) +Newsgroups: sci.math.num-analysis +Subject: Re: N-dim spherical random number drawing +Date: 20 Sep 1996 13:04:58 -0500 + +In article <wmEgVDG00VUtMEhUwg@andrew.cmu.edu>, +Shing-Te Li <sl2x+@andrew.cmu.edu> wrote: +>Does anyone know a good algorithm that can draw a vector of random +>numbers uniformly from a N dimensional sphere? + +Here are four methods for generating uniformly-distributed points on a unit +sphere: + + (1) The normal-deviate method. Choose x, y, and z, each + normally-distributed with mean 0 and variance 1. (Actually, the + variance does not matter, provided it remains fixed.) Normalize + the result to obtain a uniform density on the unit sphere. + + This method also generalizes nicely to n dimensions. It works + because the vector chosen (before normalization) has a density + that depends only on the distance from the origin. The method is + mentioned in Knuth, _Seminumerical Algorithms_, 2nd. ed., pp. + 130-131. + + (2) The hypercube rejection method. Choose x, y, and z, uniformly + distributed on [-1,1]. Compute s = x^2 + y^2 + z^2. If s > 1, + reject the triplet and start over. Once you have an acceptable + vector, normalize it to obtain a point on the unit sphere. + + This method also generalizes to n dimensions, but as the dimension + increases the probability of rejection becomes high and many random + numbers are wasted. + + (3) The trig method. This method works only in 3-space, but it is + very fast. It depends on the slightly counterintuitive fact (see + proof below) that each of the three coordinates of a uniformly + distributed point on S^2 is uniformly distributed on [-1,1] (but + the three are not independent, obviously). Therefore, it + suffices to choose one axis (Z, say) and generate a uniformly + distributed value on that axis. This constrains the chosen point + to lie on a circle parallel to the X-Y plane, and the obvious + trig method may be used to obtain the remaining coordinates. + + (a) Choose z uniformly distributed in [-1,1]. + (b) Choose t uniformly distributed on [0, 2*pi). + (c) Let r = sqrt(1-z^2). + (d) Let x = r * cos(t). + (e) Let y = r * sin(t). + + (4) A variation of method (3) that doesn't use trig may be found in + Knuth, _loc. cit._. The method is equivalent to the following: + + (a) Choose u and v uniformly distributed on [-1,1]. + (b) Let s = u^2 + v^2. + (c) If s > 1, return to step (a). + (d) Let a = 2 * sqrt(1-s). + (e) The desired point is (a*u, a*v, 2*s-1). + + This method uses two-dimensional rejection, which gives a higher + probability of acceptance compared to algorithm (2) in three + dimensions. I group this with the trig method because both + depend on the fact that the projection on any axis is uniformly + distributed on [-1,1], and therefore both methods are limited to + use on S^2. + +I have found the trig method to be fastest in tests on an IBM RS/6000 +model 590, using the XLF 3.2 Fortran compiler, with the IMSL +subroutines DRNUN and DRNNOA generating the random numbers. The trig +routines were accelerated by using the MASS library, and sqrt +computations were performed by the hardware sqrt instruction available +on the 590. Timings for 1 million random points on S^2: + + (1) normal-deviate method 9.60 sec. + (2) 3-D rejection 8.84 sec. + (3) trig method 2.71 sec. + (4) polar with 2-D rejection 5.08 sec. + +The fact that algorithm (3) does not involve rejection turns out to be +a huge advantage because it becomes possible to generate all the points +in a single loop with no conditional statements, which optimizes very +effectively and more than offsets the cost of using trig functions. +The alternative with algorithms (2) and (4) is to generate the points +one at a time and test each one for acceptance before proceeding to the +next, which requires nested loops and many more subroutine calls to +generate the random numbers. Algorithm (1) does not explicitly use +rejection, but the IMSL subroutine DRNNOA uses rejection in generating +the normally-distributed numbers that are required. + +In higher dimensions, the normal-deviate method is preferred +because the probability of rejection in the hypercube method +grows very rapidly with the dimension. + +-------------------------------------------------------------------- + +Here is a proof of correctness for the fact that the z-coordinate is +uniformly distributed. The proof also works for the x- and y- +coordinates, but note that this works only in 3-space. + +The area of a surface of revolution in 3-space is given by + + A = 2 * pi * int_a^b f(x) * sqrt(1 + [f'(x}]^2) dx + +Consider a zone of a sphere of radius R. Since we are integrating in +the z direction, we have + + f(z) = sqrt(R^2 - z^2) + f'(z) = -z / sqrt(R^2-z^2) + + 1 + [f'(z)]^2 = r^2 / (R^2-z^2) + + A = 2 * pi * int_a^b sqrt(R^2-z^2) * R/sqrt(R^2-z^2) dz + + = 2 * pi * R int_a^b dz + + = 2 * pi * R * (b-a) + + = 2 * pi * R * h, + +where h = b-a is the vertical height of the zone. Notice how the integrand +reduces to a constant. The density is therefore uniform. + +-- +Dave Seaman dseaman@purdue.edu + ++++ stop the execution of Mumia Abu-Jamal ++++ + ++++ if you agree copy these lines to your sig ++++ +++++ see http://www.xs4all.nl/~tank/spg-l/sigaction.htm ++++ +============================================================================== + +From: George Marsaglia <geo@stat.fsu.edu> +Subject: Random points on a sphere. +Date: Tue, 15 Jun 1999 18:03:07 -0400 +Newsgroups: sci.math.num-analysis,sci.math,sci.stat.math + +Questions on methods for random sampling from the surface +of a sphere keep coming up, and seemingly get lost as +new generations of computer users encounter the problem. + +I described the methods that I had used for years in +"Sampling from the surface of a sphere", Ann Math Stat, v43, 1972, +recommending the two that seemed the fastest: + +1) Choose standard normal variates x1,x2,x3, +put R=1/sqrt(x1^2+x2^2+x3^2) and return the point +(x1*R,x2*R,x3*R). + +2) Generate u and v, uniform in [-1,1] until S=u^2+v^2 < 1, +then return (1-2*S,u*r,v*r), with r=2*sqrt(1-S). + +The second method is fast and easiest to program---unless +one already has a fast normal generator in his library. + +Method 1) seems by far the fastest (and easily applies to higher +dimensions) if my latest normal generator is used. +It produces normal variates at the rate of seven million +per second in a 300MHz Pc. + +George Marsaglia |