initial commit

author: sanine <sanine.not@pm.me> 2022-01-24 19:34:28 -0600
committer: sanine <sanine.not@pm.me> 2022-01-24 19:34:28 -0600
commit: ec20b587362d76d6c48ecc1a5c1e65f1bb9293da (patch)
tree: e6739fc0925b007eee136a1b90a5d3d0abba8494
2 files changed, 918 insertions, 0 deletions
diff --git a/reference/sphere.faq b/reference/sphere.faq
new file mode 100644
index 0000000..cc60ca5
--- /dev/null
+++ b/reference/sphere.faq
@@ -0,0 +1,761 @@
+		TOPICS ON SPHERE DISTRIBUTIONS
+
+[Latest version of 1998/02/26 by Dave Rusin, rusin@math.niu.edu.
+This file is available by FTP (or gopher) at 
+	ftp://math.niu.edu/pub/papers/Rusin/known-math/95/sphere.faq
+but I recommend instead this URL which gives links to some related areas:
+	http://www.math.niu.edu/~rusin/known-math/index/spheres.html
+If you're reading this, send me your comments and suggestions.]
+
+I'm writing this FAQ because of the frequency with which topics related
+to this subject appear in the USENET newsgroups such as sci.math. This
+isn't really my main area of interest so some of the comments are
+pretty simple-minded; if you are a power user in need of guidance, you
+should look elsewhere for suggestions, while conversely if you can
+offer expert guidance I'd like to incorporate it here. Some of the
+material already here is lifted from other sources; whenever possible
+I try to assign the blame^H^H^H^H^H credit to someone else.
+
+The overall main topic is the question of how to place points on the 
+sphere, or how to divide up the sphere into regions, in a regular way.
+Along the way we'll talk about map-making, Platonic solids, and so on.
+The following questions will be addressed (other suggestions welcome):
+
+Q0. Notation
+Q1. What does it mean to place  N  points "evenly" on a sphere?
+Q2. Are there some values of  N  which are better than others?
+Q3. How can you describe locations on a sphere, anyway?
+Q4. Can you give a quick approximation to a good distribution? (*)
+Q5. How can we improve an approximate solution? (*)
+Q6. What if I want randomly to place points with a uniform distribution?
+Q7. Can this be generalized to higher-dimensional spheres?
+Q8. How is this related to collections of linear subspaces?
+Q9. Where can I get the coordinates of the vertices of the Platonic solids?
+Q10. How come this sounds like sphere packing?
+
+(*) Fetch the accompanying file sphere.bas for a BASIC program demonstrating
+these techniques.
+
+Other topics possibly to include in the future:
+	What's the formula for the volume of the n-ball? n-sphere?
+	What's this I hear about exotic differentiable structures on spheres?
+	Can you describe projective space and its relation to spheres?
+	What are sources for further information?
+If you need some information about these topics, let me know; that will be
+my clue that it's time to add something to the FAQ.
+
+At some point I may separate this into a 2-sphere FAQ and a hi-dimensional-
+sphere FAQ. If this seems important, let me know and I'll move it up my
+priority list. 
+
+dave
+
+==============================================================================
+	ANSWERS TO THE FREQUENTLY-ASKED QUESTIONS
+==============================================================================
+
+Q0. Notation 
+Just in case there is some confusion I'll informally make the definitions
+I'll need later.
+	EUCLIDEAN SPACE  (or n-space, or R^n) is the set of n-tuples of
+real numbers. We will use the ordinary metric (distance function). Note
+that we commonly speak of ourselves as living in 3-space.
+	The  SPHERE  of radius  r  in  R^n  is the set of points whose
+coordinates satisfy x1^2+...+xn^2=r^2; the unit sphere is the sphere of
+radius  1.  Note that I am explicitly assuming all spheres are centered
+at the origin. The interior of the sphere is called the  BALL  in  R^n;
+it is the set of points of distance less than  r  from the origin. (The
+ball and the sphere are disjoint; their union is sometimes called the
+n-DISK .) If not otherwise specified we are usually referring to the sphere or
+ball in  R^3.
+	The  DIMENSION  of a (reasonable) subset of  R^n  is the number of
+independent directions one can travel in the set. For example, in the 
+ordinary sphere motion at most points is a combination of movements
+East and North, so the sphere in R^3  is two-dimensional (not three).
+The ball in  R^n  is n-dimensional.
+	Note that the circle is the sphere (of dimension 1) in  R^2;
+the sphere in R^1 is two points (+1 and -1). We often write S^n for the
+n-dimensional sphere (in R^(n+1).)
+	A  LINE  is the set of multiples of a point or vector (other
+than the origin).  Note in particular that I am referring only to lines
+thru the origin.  A  PLANE  is the set of linear combinations of two
+non-collinear vectors.  (Don't call it a  SURFACE ; that's usually
+understood to be anything 2-dimensional, including planes, the
+2-sphere, paraboloids, and so on.)  Likewise we can take any k
+linearly independent vectors in R^n and look at all linear
+combinations of them; this set is called a  LINEAR SUBSPACE  of R^n and
+is of dimension k.  When k=n-1, the set is called a  HYPERPLANE.
+
+	The focus of this document is on S^2 but from time to time I
+consider greater generality when it is easy to do so.
+
+==============================================================================
+
+Q1. What does it mean to place  N  points "evenly" on a sphere?
+
+Well, what do you want it to mean? Several valid interpretations exist;
+they lead to slightly different distributions of points.
+
+If you want all distances between distinct points in the set to be equal,
+you can have at most  4  points. Indeed, it's easy to see any 3
+equally spaced points mark an equilateral triangle and any 4 must then
+mark a regular triangular pyramid. You can have a 5th point equidistant
+from the first  3 (glue two such pyramids face to face) but then the
+4th and 5th points are too far apart.
+
+But everyone agrees that the vertices of a regular polygon are equally
+spaced in S^1, so there must be another definition we have in
+mind. One can ask that every point have some property, such as being
+equidistant from its nearest neighbor. Such distributions exist -- for
+example, one can space the points equally along the equator of the
+sphere. Still, this is not likely to be what we have in mind when we
+ask for a uniform distribution of the points.
+
+One might ask that the points be arranged "symmetrically" on the sphere in
+some way; in Q3 we'll discuss how to do this, but true symmetry is available
+only for a few values of  N.
+
+It is more helpful to measure uniformity not point-by-point but by considering
+some aggregate measure of the dispersion. For example, we could let  D be
+the minimum distance between any pair of distinct points in the set. This
+provides a measurement of how good or poor a distribution we've chosen.
+For example, four points equally spaced on the equator have  D=sqrt(2)=1.41...
+whereas four points on the pyramid have a larger  D=(2/3)sqrt(6)=1.63...
+So we might ask for an arrangement on the sphere which maximizes  D.
+
+In general this statement of the problem guarantees a solution exists.
+Let  D  be any continuous function of  N  points on the sphere; then since
+the sphere is compact, so is the Cartesian product of  N  copies of it,
+so that  D  is a continuous function on a compact set, and so achieves a
+maximum and a minimum. If  D  is chosen intelligently, the values of the
+variables that lead to an extreme value of  D  will give an arrangement of
+points which is arguably "uniform".
+
+There are some drawbacks with this perspective. For one, the extreme
+values of D will be attained more than once. (Certainly if D depends
+symmetrically on the points, any permutation of the N variables will
+provide another occurrence of the extreme value. Indeed, it is
+reasonable to consider only functions defined on the symmetric product
+S^N(S) =(SxSx...xS)/Sym_n rather than on the Cartesian product
+(SxSx...xS) of N copies of the sphere S) If D is invariant under
+rotations of the sphere, which is reasonable, then in order to hope to
+have an isolated extreme configuration, one needs to hold one point
+fixed at, say, the North pole, and to allow another point to vary only
+along a meridian attached to the pole.
+
+Another obstacle is that there are several natural choices for  D, and an
+arrangement of points which is optimal for one such function need not be
+optimal for another. Here are some natural choices for D:
+	D1=min_i min_{j<>i} dist( Pi, Pj )   (the function considered above)
+	D2=(1/N) Sum_i min_{j<>i} dist( Pi, Pj ) ("average dist to a neighbor")
+	D3=Sum_i Sum_{i<>j} 1/dist( Pi, Pj ) ("potential energy")
+If an electron is placed at each of the points, D3 is the sum of the
+electrostatic potentials at these  N  electrons; they would tend to spread
+apart if possible so as to minimize  D3.
+
+Finally, there is the question of how one would compute an optimal 
+configuration once a decision is made of how optimality would be measured.
+If  D  is a differentiable function of the coordinates of the points, then
+the problem can be solved with Lagrange multipliers, but this is hopelessly
+cumbersome for any but the smallest values of  N  and the simplest
+functions  D. Moreover, some of the most natural functions  D  to take are
+not differentiable everywhere. We shall return to this in Q5.
+==============================================================================
+
+Q2. Are there some values of  N  which are better than others?
+
+You bet! As has been pointed out above, values of N <= 4 have very natural
+solutions. In addition, there are solutions for some  N  related to the
+Platonic solids. Here's why.
+
+Suppose we had an arrangement of  N  points with the following property:
+given any two points there is a rotation of the sphere which 
+	(1) carries the one to the other and in the process 
+	(2) carries each of the points in the distribution to some other point 
+		in the set (possibly itself). 
+These rotations generate a group (a subgroup of SO(3) ) of
+permutations of the N points; since the only rotation of the sphere
+which fixes two points not antipodal to each other is the identity
+(the trivial rotation), the group of all these rotations is finite
+(unless N=2).
+
+Well, as it happens, there are not many finite subgroups of SO(3).
+
+First, there are all the cyclic subgroups which are sets of rotations
+with a common axis, as well as the dihedral groups which adjoin one
+rotation about an axis perpendicular to this one. If this is the
+symmetry group of your arrangement of points, then all your points
+lie on one circle (two circles, if you have a dihedral group and a
+non-equatorial arrangement). 
+
+The remaining finite subgroups of SO(3) are the symmetry group of the
+pyramid (isomorphic to Alt(4), it has order 12), the symmetry group of
+the cube (it's the full symmetric group of order 24, permuting the 4
+diagonals) and the symmetry group of the dodecahedron (the symmetric
+group of order 120). The octahedron is dual to the cube and the
+icosahedron is dual to the dodecahedron, while the tetrahedron
+(pyramid) is self-dual.  (Here "dual" means vertices of one are
+midpoints of faces of the other, so dual polyhedra have the same
+symmetry group.)  (For a full analysis of the Platonic solids, see
+Coxeter, "Introduction to Geometry", Wiley (NY) 1969.)
+
+It's not quite true that a set of points permuted by one of these three
+groups has to be the set of vertices of a Platonic solid (as a glance
+at a soccer ball [non-US: football] will show). However, a firm
+insistence that there be sufficient rotations to carry each vertex
+to any other limits the number of points to 120.
+
+So there are only finitely many non-circular arrangements with this high 
+degree of symmetry. Of course, you could relax the condition that there be
+rotations preserving the set of points which permute the points
+transitively (any point can be carried to any other one). For example, you
+could begin with one of the Platonic solids and mark off, say, the
+midpoint of each edge as well as each of the vertices. (The former need
+to be scaled of course to make them lie on the sphere.) There is a large
+collection of figures that exhibit some partial symmetry. In general, the
+more symmetry you require, the fewer the number of values of  N  which
+you can expect to place so symmetrically.
+
+If you are working with spheres of a higher dimension, there are more
+finite groups of rotations; the orbits of a point under these groups will
+give sets of points which could reasonably be called "uniform". Indeed,
+given any set of points on a sphere (or any other metric space) you can
+define the Voronoi region of each point to be the set of elements in the
+sphere closer to that point than to any other. Then if the rotation group
+acts transitively on the points, it provides enough motions to show that
+all the Voronoi regions are congruent. A couple of theorems are relevant
+here, I guess: first, given any group  G  there is a group of rotations
+in  SO(n)  isomorphic to  G  as long as  n  is sufficiently large. Second,
+given any  n  there may be infinitely many finite subgroups in SO(n)
+(as the cyclic subgroups of SO(3) demonstrate) but there are only finitely
+many quotient groups  G / Z(G); that is, the  G  will have an (Abelian)
+center of bounded index.
+
+Departing from these geometric considerations, we will see in Q4 that
+some approximate constructions of uniform arrangements are a little
+easier for some N's than others, so if you have some freedom in the
+selection of N you can make your job a little easier.
+
+==============================================================================
+
+Q3. How can you describe locations on a sphere, anyway?
+
+Clearly any point on a sphere also sits in the ambient Euclidean space,
+and so may be specified by its ordinary Cartesian coordinates. But these
+coordinates are bound by the equation of the sphere, and it is sometimes
+preferable to use coordinates which are unbound.
+
+Since the sphere in  R^n is (n-1)-dimensional, it is natural to try to 
+parameterize your position with just  n-1 variables. This is well known
+on the ordinary 2-sphere, where we use latitude and longitude. Such
+parameterizations are possible, although there are some nettlesome points
+to keep in mind.
+
+Ideally, a parameterization of the n-sphere would be a function P
+defined on a subset of Euclidean space R^n and taking values in the
+n-sphere, having the following properties:
+	1) P is continuous (i.e. a continuous change in  R^n  should yield
+		a continuous change on the sphere. Equivalently,  P 
+		should be given by  (n+1)  continuous coordinate functions
+		whose squares sum to  1. )
+	2) P is one-to-one: a point on the sphere shouldn't show up more
+		than once during the parameterization.
+	3) P has a continuous inverse  F, that is, a coordinate function
+		which determines for each point in the sphere what values
+		of the parameters are needed to map to that point.
+		The existence of such an  F  implies  P  would be onto.
+	4) P (and/or  F)  preserves important geometric information like 
+		angles or areas (or the n-dimensional equivalents).
+	5) P (and/or F)  is differentiable.
+Unfortunately, for topological reasons there can be no function P
+meeting even just (1) (2) and (3). This is well-known to map-makers
+trying the case n=2: any map of the earth's surface has to rip the 
+surface somewhere.
+
+However, there are functions which meet some of these conditions; various ones
+are appropriate under different conditions.
+
+1. If you only want to parameterize _part_ of the sphere, you
+can get away with a simple projection: use  F(x1,...,x{n+1})=(x1,...,xn)
+and  P(x1,...,xn)=(x1,...,xn, sqrt(1-x1^2-...-xn^2) ). Use the negative
+square root to parameterize the bottom half of the sphere. If you need
+to have a parameterization valid a neighborhood of some point on the
+equator, have  F  drop out some other coordinate  (which  P  puts
+back in with a similar square root formula).
+
+2. You can get a correspondence P mapping onto all of the sphere
+except one point (say, the North pole "NP"); indeed you can define 
+P : R^n --> S-{NP} and an inverse map F going the other way which are
+continuous, differentiable, one-to-one, and onto. Do this with
+stereographic projection: place the plane thru the equator of the
+sphere and draw rays emanating from the North Pole and pointing
+downward some non-zero amount. Each such ray intersects both the
+sphere and the plane in one point, and all the points in S-{NP} and
+R^n are so met.  These rays then set up the one-to-one
+correspondence. The formulas are
+	P(x1,...,xn) = t.(x1,...,xn,0) + (1-t)(0,...,0,1)
+	F(x1,...,x{n+1})= r . (x1, ..., xn)
+(where (0,...,0,1) is the North Pole on the n-sphere in  R^{n+1}.) Here
+t is chosen to make the distance from the origin be  1 : t=2/(1+x1^2+...+xn^2).
+Also  r  is chosen to invert the first formula: r=1/(1-x{n+1})
+
+These formulas are the basis of the polar projection used by map-makers.
+
+3. If it is necessary to have a parameterization which preserves area
+(or its n-dimensional equivalent), this coordinate system  F  can be
+modified by adjusting the function  r=r(x{n+1}). 
+
+Here's how we decide what the right function  r  is. Consider the set of
+points in  S^n  with  x{n+1}=a. This is an (n-1) sphere with radius 
+h=sqrt(1-a^2). Then for small  e  the set of points in  S^n  with 
+x{n+1} lying between  a  and  a+e is a slightly bent version of the
+product of an (n-1)-sphere of radius  h  and an interval of length 
+e.sqrt[1+(dh/da)^2] = e/sqrt(1-a^2). The function  F  above carries it
+to another such product, but this time the radius is  z(a)=r(a)h(a) and
+the length of the interval is about e.z'(a). Since F changed the radius 
+by a factor  z(a)/h, the (n-1)-volume will change by a factor
+of  [z(a)/h]^(n-1). If we want to keep the  n-volume unchanged, we need
+the ratio of the lengths of the interval, roughly  z'(a) sqrt(1-a^2), to
+be the reciprocal of this, that is, we need   z'(a).z(a)^(n-1) =
+h^(n-1)/sqrt(1-a^2) = sqrt(1-a^2)^(n-2). Solving this differential
+equation shows we need 
+  r(a) = z(a)/h(a) = [1/sqrt(1-a^2)].[n.integral(1-a^2)^(n/2-1) da]^(1/n)
+(As z(-1)=0, the integral should be selected to vanish at a=-1.)
+
+So  r(x)=[1/sqrt(1-x^2)].[n. integral_{-1}^{x} (1-a^2)^(n/2 - 1) da]^(1/n).
+For  n=2, this gives  r(a)=sqrt(2/(1-a)), i.e. z(a)=sqrt(2*(1+a)). In
+particular, this will map the whole sphere to the disk of radius 2,
+whose area is then 4pi (the area of the sphere) as it should be.
+
+4. There are also trigonometric parameterizations.
+For example, it is easy to parameterize the 1-sphere (the circle) with a
+parameter  t  using  P(t) = (cos(t), sin(t)). Here if we really want P
+to be one-to-one we need to restrict to an interval such as  [0, 2 pi).
+This is then indeed one-to-one, onto, and continuous (even differentiable)
+and as a bonus preserves lengths. But there can be no continuous inverse
+function. F(x,y)=arctan(y/x) will do if branches of the arctan function 
+are chosen appropriately, but no set of choices can be made consistently
+in all quadrants of  S^1.
+
+Still, this parameterization is useful, and admits generalization. The
+function  
+	P(u,v) = (cos(u)cos(v), cos(u)sin(v), sin(u)) 
+is also differentiable, maps onto S^2, and is one-to-one on the set 
+(-pi/2,pi/2)x[0,2 pi). (The image of  P  on this set misses the poles, 
+however.) This  P  has an inverse in the sense we had on the circle: 
+(u,v)=F(x,y,z)=(arctan(z/sqrt(x^2+y^2)), arctan(y/x) ). 
+	This is the ordinary parameterization of the sphere in terms of 
+latitude (u) and longitude (v). By computing the Jacobian for the change of 
+variable to spherical coordinates, we see that this parameterization 
+changes areas by a factor of  |cos(u)|.
+
+This may be generalized to the n-sphere: use the function  P_n, where
+P_n(u1,...,un) = ( cos(un)*P_{n-1}(u1,...,u{n-1}) ,  sin(un) )
+(for each  un  in (-pi/2, pi/2), the parameterization traces out an
+(n-1)-sphere using P_(n-1). ); n-dimensional volume is reduced by a
+factor of  | cos(u2) . cos(u3)^2 . ... . cos(un)^(n-1) |.
+(I'll discuss another interpretation of this product in Q6.)
+
+These parameterizations form the basis of various cylindrical maps of
+the earth (rectangular maps excluding the poles in which the left and
+right edges represent the same points on earth). The Mercator projection
+is chosen to remove the distortion of angles; thus (locally) the
+Mercator projection gives an accurate representation of shapes. But the
+correction used is precisely the inverse of what is needed to correct
+the |cos(u)| change in area, so that the Mercator projection is wildly
+inaccurate at portraying areas (especially near the poles). I believe
+the 'Peters projection' is the name of the map which corrects areas (but
+which amplifies the distortions of angles, and thus of shapes).
+
+5. If the trigonometric expressions are too computation-intensive, 
+one can use a parameterization requiring only rational functions. Note first
+that the circle may be parameterized using a variation of stereographic
+projection:
+	P(t) = ( -2t/(1+t^2) , (1-t^2)/(1+t^2) )
+	F(x,y)= (y-1) /x
+Thus, we can replace all sine/cosine pairs with these two functions.
+For example, the 2-sphere may be parameterized:
+P(u,v)=( 4uv/[(1+u^2)(1+v^2)], -2u(1-v^2)/[(1+u^2)(1+v^2)], (1-u^2)/(1+u^2) )
+F(x,y,z)= ( (z-1)/sqrt(x^2+y^2), (y-1)/x )
+
+For further details, consult an advanced calculus text; look up the
+change-of-variables theorem, particularly spherical coordinates and
+the role of the Jacobian (derivative) in determining areas.
+[I'll add a reference to the mathematics of map-making as soon as I find one.]
+
+==============================================================================
+
+Q4. Can you give a quick approximation to a good distribution?
+
+Of course. Mark off  K  equal segments along a meridian using  K-1
+points plus the two endpoints. These segments have length pi/K. If you
+swing this around the sphere, you'll mark off  K-1  circles. Now, in
+spherical coordinates, these circles are the points with  
+ui=-pi/2 +i*(pi/K), where  i = 1, 2, ..., K-1. They have radius  cos(ui), 
+hence length 2pi*cos(ui). Thus you can divide them into about 
+[2pi*cos(ui)]/[pi/K] segments as long as the segments on the meridian.
+So take  Mi  to be an integer near 2*K*cos( -pi/2 + i*(pi/K)) =
+2*K*sin( i * (pi/K) ), and mark off points that have  u=ui and 
+v=j*(2*pi/Mi) for  j = 1, 2, ..., Mi. This will give you
+2+Sum{i=1,...,K-1} M_i points on the sphere.
+
+To within an error less than  K-1,  this is the same as 
+2+Sum 2*K*sin( i* pi/K ) = 2 + 2*K*cot( pi / 2K ). For large values of  K,
+this is about (4/pi) K^2.  (With an area of 4pi on the whole sphere, 
+this leaves an area of (pi/K)^2 around each point, which is not
+surprising since initially the points are about pi/K apart on a 
+roughly square grid).
+
+If you have the luxury of placing _approximately_ N points on the
+sphere, begin with  K  an integer roughly equal to  sqrt((pi/4)N), and
+compute the number of points which would be placed with the method
+above; you should get about  N  points. Here are the values I get
+for the first few  K  using round() to convert to integers when necessary:
+for K=1, N=2; for K=2, N=6; then for subsequent  K (thru K=22), I get N=
+12,22,34,46,82,106,128,156,184,214,248,288,328,368,412,460,510,562,616,...
+
+Notice that the first layer around the north pole has about 2Ksin(pi/K)
+points. For  K large, this is about  2*pi = 6.28, so we would take M1=6,
+and arrange the first circle of points in the "kissing pennies" pattern.
+
+This arrangement of points on the sphere has an obvious regularity,
+but the points are not spaced as far apart as possible by any reasonable
+measure -- for example, roughly sqrt(K/5) of the rows nearest the
+equator will have about the same number of points in them, roughly in
+a square-looking grid; clearly a honeycomb distribution will place the
+same number of points with a greater spacing. This leads to question Q5.
+
+I have included as a separate file a BASIC program which carries out this
+distribution. In view of the previous paragraph, this code can be
+improved when  N  is large, but I'm not really sure of an optimal way to 
+do so, so I won't.
+
+==============================================================================
+
+Q5. How can we improve an approximate solution?
+
+If the approximate solution in Q4 is insufficient, you'll have to decide why.
+This returns you to the discussion on Q1. Let me assume that there is some
+function  D  of the positions of the points which you wish to minimize
+or maximize. (Replacing   D  by  -D  as necessary we may assume D is to be
+maximized.)
+
+The method outlined in  Q4  should give a low value of  D. If  D  is
+differentiable, you can use the method of gradients: you adjust the positions
+of the points in the direction of the gradient. Professionals in
+numerical analysis should scoff at this naive approach -- for one thing,
+this method assumes that the critical point is an isolated minimum; also,
+it is not clear just how far in the direction of the gradient to 
+proceed so as not to overshoot the region of convergence; moreover, once
+in the region of convergence, the rate of convergence can be quite slow.
+However, in practice I have found that if you have a moderate number of
+points to place and you are not too demanding in your search for an
+absolute minimum, this method works quickly enough if you begin with a
+moderately uniform distribution of points. (I would appreciate learning
+of improvements from numerical analysts.)
+
+Here is what the method of gradients means in more detail.  If you
+have parameterized the sphere and expressed D as a function of N sets
+of the parameters, compute the partial derivatives of D with respect
+to all these variables, and then evaluate at the present values of the
+parameters. Add to each of the parameters a small multiple of this
+partial derivative.
+
+If instead you have calculated D as a function of the Cartesian
+coordinates of the N points, you need to take the portion of the
+gradient which points in a direction tangent to the sphere. In
+practice, this can be accomplished by adjusting all coordinates of the
+points as in the preceding paragraph, then rescaling each point's
+coordinates to ensure the vector is of length one.
+
+There is the possibility of creep to be concerned with. That is, it is
+possible that with each iteration the function   D   is not improved;
+rather, the points have simply all rotated somewhat around the sphere.
+Under the assumption that  D  is invariant under rotations, there is no
+need to allow this kind of motion. One can negate this possibility by
+rotating the sphere after each iteration so that one point (say the
+North pole) is returned to its previous position and another point
+is returned to the meridian it was on previously.
+
+I have put a BASIC program alongside this file to illustrate these
+computations. Since the function  D  I used polls each pair of points
+to find out the repulsion between them, the running time for one
+iteration is proportional to  N^2. Thus I hardly recommend it for
+values of  N  above a couple of hundred if you are using, say, a
+personal computer.
+
+==============================================================================
+
+Q6. What if I want randomly to place points with a uniform distribution?
+
+This is perhaps the easiest way: randomly select the Cartesian
+coordinates independently. Toss out any sets of coordinates which take
+you outside the sphere (and, should you be so unlucky, any occurrences
+of the origin). Scale all the remaining points to push them out to the sphere.
+
+Another easy way is to use the area-preserving parameterizations of
+Q3. If you use the cartographer's polar projection (the third
+parameterization in Q3), you need only select random points in the
+disk with uniform distribution there, then map each of the selected
+points onto the sphere. Of course, you'll never hit the North Pole
+this way, but that event was supposed to occur with probability zero
+anyway. (The simplest way to select random points in the disk is just
+to pick the n coordinates separately, each randomly in [-1,1]. Then
+simply discard the n-tuples not lying in the disk.)
+
+You can also use the trigonometric parameterizations of the sphere
+(the cartographer's cylindrical projections): if you are placing
+points randomly, the probability that you will have to consider one of
+the points where the parameterization fails to be one-to-one, or
+continuous, say, is zero.
+
+However, you need to exercise some care for spheres of dimension
+greater than 1 since in that case, the trig parameterizations are not
+area-preserving.  For example, if you use the spherical coordinates
+above to parameterize the 2-sphere, and select u and v randomly in
+their respective intervals, you'll find too many points chosen near
+the poles. This is because the sets of points with u fixed near 
++-pi/2 are only _small_ circles; it is not appropriate to select these
+u's as frequently.  Indeed, the frequency with which a given u is
+selected should be proportional to the circumference of that circle,
+which is 2pi |cos(u)|.
+
+More generally, one can randomly select points in the n-sphere by randomly
+selecting the n spherical parameters, but always remembering to select
+each  u_i  with a frequency proportional to the  (i-1)-dimensional
+measure of the the set of points where  u_i  is a constant. Fortunately,
+we know that this set is just an (i-1) sphere whose radius is cos(u_i) 
+and whose measure is then proportional to  cos(u_i)^(i-1).
+
+Thus the problem of random point selection on the sphere reduces to this
+question: how can one select values of a variable   u  from an interval
+[a,b] so as to achieve a given frequency distribution  f? One way is to
+let  C  be the cumulative distribution function, C(u)=integral_a^u f(t)dt.
+(This is the probability that the chosen value will be between  a  and  u.)
+For then if  G  is the inverse of this function, then we may select
+x randomly with uniform distribution across [0,1] and let  y=G(x). This is
+a random variable whose distribution function looks like this:
+         Prob( h < y < h+e ) = Prob( h < G(x) < h+e )
+                             = Prob(C(h) < x < C(h+e) ) which is roughly
+                             = Prob( C(h) < x < C(h)+C'(h)e )
+using differentials. Well, if  x  is selected with uniform distribution,
+this probability is the length of that interval, namely  C'(h)e = f(h).e
+since  C'=f. But to say that this original probability is roughly  f(h)e
+is precisely to say that the distribution function for the variable y  is  f.
+
+With the frequency distributions at hand, this is not difficult. The
+first coordinate  u1  is selected uniformly in  [0,2 pi).  The second
+coordinate  u2  is to have a probability density function  cos(u_2) /2
+on (-pi/2, pi/2)  [The extra factor of  1/2  is just a scaling needed to
+ensure that the cumulative distribution  C(u)  ends with  C(pi/2) = 1.0].
+Indeed,  C(u) = (sin(u)+1)/2 has inverse  G(w)= arcsin(2w-1), so one can
+get points uniformly on the 2-sphere using the usual trig parameterization
+with  v  selected uniformly in  [0, 2pi)  and  u=arcsin(2w-1) where
+w  is selected uniformly in  [0,1]. The Cartesian coordinates are then
+(sqrt(4x-4x^2).cos(v), sqrt(4x-4x^2).sin(v),  2x-1).
+
+With a slight change of variables, this can be viewed as the equal-area
+cylindrical projection in cartography: you can parameterize the surface
+of the 2-sphere with coordinates  x  in [-1, 1]  and  v  in [0,2 pi) 
+in such a way as to preserve area by mapping  (x,v) to
+(sqrt(1-x^2) cos(v), sqrt(1-x^2) sin(v), x). This is the projection 
+which carries points on the sphere (minus the poles) to points on a 
+cylinder wrapped around the equator by shining a light straight out from
+the axis through the sphere to the cylinder.
+
+On the 3-sphere (and higher) the next variable is selected as  u3=G(x3)
+where  x3  is selected uniformly in [0,1] and  G  is the inverse of
+the function  C(u) = (1/2)(u+pi/2+sin(u)cos(u)). The next variable
+after that is selected similarly using  C(u) = ( cos(u) - sin^3(u)/3 - 1/3).
+And so on.
+
+Observe that if  C  can be computed, it is not necessary to be able to
+solve for its inverse  G  explicitly. Instead, each of the computations of
+y=G(x)  may be written  x=C(y), an equation to be solved for  y  when  x
+is known. This is easily handled with Newton's method.
+
+I believe a method of generating numbers randomly with a given distribution
+is treated in Knuth's Art of Computer Programming.
+
+==============================================================================
+Q7. Can this be generalized to higher-dimensional spheres?
+
+I have melded this into the previous questions -- anyone reading this
+who can think of questions particular to higher dimensions should let me
+know what ought to go in here.
+
+==============================================================================
+
+Q8. How is this related to collections of linear subspaces?
+
+I put this question here because someone asked me recently about how
+to parameterize the set of planes and other linear subspaces of Euclidean
+space. I can't quite remember the question, so I'll not make an
+answer at this point.
+
+One comment to make is that points on the sphere (or, more precisely,
+pairs of antipodal points) correspond to lines through the origin in 
+R^n. The planes in  R^n  correspond similarly to great circles (circles
+of maximal circumference). If  n=3,  the planes can be parameterized in
+a natural way by finding the line perpendicular to them. If you
+trace these correspondences through, you'll see that there is a 
+natural one-to-one correspondence between great circles on the 2-sphere
+and pairs of antipodal points. You can get the circle from the points
+as the set of points equidistant from the pair; you can get the pair from
+the circle as the only points equidistant from all elements of the circle.
+
+More generally, there is a correspondence between the  k-dimensional
+and the n-k dimensional subspaces of  R^n. The family of all k-dimensional
+subspaces in  R^n  (or  n-k dimensional subspaces) is known as the
+Grassmannian manifold; it is indeed a manifold and has dimension
+k(n-k). When  k=1 or k=n-1, this is also known as projective n-1 space;
+there is a two-to-one mapping  S^{n-1} --> RP^{n-1}  which simply sends
+a point to the line it's on (this map sends antipodal points of  S^{n-1}
+to the same element of  RP^{n-1} of course. This map turns  S^{n-1}
+into a two-fold cover of  RP^{n-1}; for  n>2  this is the universal
+covering of projective space.)
+
+This material is often treated in graduate topology courses, particularly
+in a discussion on vector bundles in an algebraic topology course.
+You may wish to consult Milnor's book "Characteristic Classes"
+(Princeton Univ Press), which is however not elementary.
+
+==============================================================================
+
+Q9. Where can I get the coordinates of the vertices of the Platonic solids?
+
+There are 5 Platonic solids. An interesting but brief discussion (with
+pictures) may be found, for example, in Gallian's  "Contemporary
+Abstract Algebra",  D C Heath 1986.
+
+Solid		Number of Vertices	# Edges		# Faces
+Tetrahedron		4		6		4
+Cube			8		12		6
+Octahedron		6		12		8
+Dodecahedron		20		30		12
+Icosahedron		12		30		20
+
+Notice the duality between certain pairs. Also note that V-E+F=2 in
+all cases. This is deep, and may be thought of as the first fact in
+homology theory.
+
+I will give coordinates of these points (and in indication of the rest
+of the structure) in the most compact way I know; if you want the
+coordinates so that, for example, the North Pole is one of the vertices,
+you'll have to rotate the coordinates (the BASIC program I include has
+in it an application of this procedure.)
+I have scaled the coordinates so that they lie in the unit sphere, but
+this is not always the prettiest form. I have however set it up so that
+the coordinates of the dual pairs reflect this duality.
+
+Tetrahedron
+The points are (0,0,1), (1/3)(2sqrt(2), 0, -1), and both points
+(1/3)(-sqrt(2), +-sqrt(6), -1). All pairs are joined by an edge; all
+triples (sets of three points) lie on a common face.
+
+Cube
+The 8 points are (+-c, +-c, +-c) where  c=1/sqrt(3) and the plus/minus signs
+may be chosen independently. 
+The 12 edges join pairs of points which have 2 coordinates equal.
+The 6 faces are quadruples of points for which one of the coordinates is 
+	constant.
+
+Octahedron
+The 6 vertices are of three types: (+-1, 0, 0) (0, +-1, 0) and (0, 0, +-1).
+The 12 edges join all pairs except the three pairs of antipodal points.
+The 8 faces consist of all triples which include one point of each type.
+
+Icosahedron
+(using some information in a post by cjhenrich@delphi.com)
+The 12 vertices are of three types: (+-a, +-b, 0), (0, +-a, +-b), and
+	(+-b, 0, +-a), where a=sqrt((5+sqrt(5))/10) and b=sqrt((5-sqrt(5))/10)
+The 30 edges come in two flavors: there are 24 edges joining points of
+	different types such that their common non-zero coordinate is of
+	the same sign (e.g., (a, -b, 0) and (0, -a, b) have negative 2nd
+	coordinates). There are also 6 edges joining pairs of points of
+	the same type for which the sign in front of the  a  are equal
+	(e.g.  (a, b, 0)  and  (a, -b, 0).)
+The 20 faces are also of two general sorts: There are 8 triplets consisting
+	of one point of each type such that whenever two of the points
+	have a common non-zero coordinate, those coordinates are of the	
+	same sign. Also, there are 12 triplets which contain two points of
+	the same type having the same sign 'X' in front of the  a,  and a
+	third point having   'X'.b  in that coordinate (e.g., given the
+	two points at the end of the last paragraph, the third point on
+	a face with them is  (b,0,+-a). )
+
+Dodecahedron: This is dual to the icosahedron
+I adapted these coordinates from a post by David Seal.
+The 20 vertices include 8 vertices (+-c, +-c, +-c)  (c=1/sqrt(3)) and
+	12 of the forms (+-x, +-y, 0), (0, +-x, +-y), (+-y, 0, +-x) where
+	x=(sqrt(5)+1)/(2*sqrt(3)) and y=(sqrt(5)-1)/(2*sqrt(3)).
+I will leave it to the reader to dualize the construction of the edges
+	and vertices of the icosahedron to get the edges and faces of	
+	the dodecahedron. (That means I started it but it took too long.
+	The edges are the pairs of points of distance sqrt( (2/9)(sqrt(5)-1) )
+	from each other.)
+
+==============================================================================
+
+Q10. How come this sounds like sphere packing?
+
+Very observant of you! Actually, there are a lot of questions whose 
+answers sound like an enumeration of the Platonic solids. Here is one
+way to see a connection.
+
+Suppose you have a packing of Euclidean space with spheres of radius  1.
+There is a way to get evenly spaced points on the unit sphere from it.
+
+For any r>0 let a_r be the number of spheres in the packing which are
+contained in the ball of radius r centered at the origin.  Note that
+if V is the volume of the ball of radius 1 in R^n, then the ball of
+radius r has volume V.r^n, so we have an upper bound of a_r <= r^n.
+The limit of a_r to r^n as r increases without bound is the 'packing
+density' d. Thus for large r we have a_r is roughly d r^n. In
+particular, there are roughly d.n.r^(n-1) balls which fit in the
+sphere of radius r+1 but not in the sphere of radius r. If we take the
+centers of just these balls and scale them back just a bit, we will
+get roughly d.n.r^(n-1) points on the sphere of radius r.  Since the
+sphere centers are of distance at least 2 from each other, the points
+on the sphere will be almost 2 units apart, or further.
+
+Scaling this picture by a factor of r,  we get about  d.n.r^(n-1) points
+on the unit sphere, with distances between them not much less than
+2/r. This is not really better than we achieved with the method in Q4,
+but it has the advantage that we can also place points on the sphere 
+corresponding to spheres in the packing which, for example, fit in
+the ball of radius  r+1.5, or r+2. Possibly these points, when projected
+to the sphere of radius  r, will lie close to other points already
+placed, but then again, maybe not.
+
+It is not possible, so far as I know, to go backwards and create a
+sphere packing from a distribution on a sphere. The difficulty is that
+a sphere packing implies a distribution of arbitrarily many points on
+the sphere, and gives these distributions in a coherent way, whereas
+an individual distribution is done with just one fixed number of points.
+
+There is abundant literature and a fair amount of controversy about
+sphere packings. Certainly the easiest sphere packings to understand
+and use are the ones with great regularity: one places a sphere
+centered at each point in a crystallographic lattice. An introduction
+to these may be found, for example, in Durbin's "Modern algebra"
+(Wiley, 1985). One looks at a subgroup of the group of translations in
+R^n which is discrete (each group element is far from any other one);
+the translates of a single point under this group form a lattice. The
+group of symmetries of such a lattice is called a crystallographic
+group. Note that there may be some symmetries which leave a point
+fixed; the group of these is finite, and indeed there are only a few
+possible groups: dropping, if necessary, to a subgroup of index two,
+we can assume the group contains only rotations, at which point it
+must be cyclic of order 1, 2, 3, 4, or 6; dihedral of order 4, 6, 8,
+or 12, or the symmetry groups of the tetrahedron or octahedron.
+
+This topic comes up from time to time in the popular press, during
+which an interview with the inimitable John Conway is de rigueur.
+There is also some very interesting work on lattices in higher
+dimensions, particularly dimensions which are a multiple of 8, such as
+the important Leech lattice. The book by Conway and Sloane is full of
+such tidbits.
+
+==============================================================================
+
diff --git a/reference/sphere.rand b/reference/sphere.rand
new file mode 100644
index 0000000..bf56705
--- /dev/null
+++ b/reference/sphere.rand
@@ -0,0 +1,157 @@
+From: ags@seaman.cc.purdue.edu (Dave Seaman)
+Newsgroups: sci.math.num-analysis
+Subject: Re: N-dim spherical random number drawing
+Date: 20 Sep 1996 13:04:58 -0500
+
+In article <wmEgVDG00VUtMEhUwg@andrew.cmu.edu>,
+Shing-Te Li  <sl2x+@andrew.cmu.edu> wrote:
+>Does anyone know a good algorithm that can draw a vector of random
+>numbers uniformly from a N dimensional sphere?
+
+Here are four methods for generating uniformly-distributed points on a unit
+sphere:
+
+  (1) The normal-deviate method.  Choose x, y, and z, each
+      normally-distributed with mean 0 and variance 1.  (Actually, the
+      variance does not matter, provided it remains fixed.)  Normalize
+      the result to obtain a uniform density on the unit sphere.
+
+      This method also generalizes nicely to n dimensions.  It works
+      because the vector chosen (before normalization) has a density
+      that depends only on the distance from the origin.  The method is
+      mentioned in Knuth, _Seminumerical Algorithms_, 2nd. ed., pp.
+      130-131.
+
+  (2) The hypercube rejection method.  Choose x, y, and z, uniformly
+      distributed on [-1,1].  Compute s = x^2 + y^2 + z^2.  If s > 1,
+      reject the triplet and start over.  Once you have an acceptable
+      vector, normalize it to obtain a point on the unit sphere.
+
+      This method also generalizes to n dimensions, but as the dimension
+      increases the probability of rejection becomes high and many random
+      numbers are wasted.
+
+  (3) The trig method.  This method works only in 3-space, but it is
+      very fast.  It depends on the slightly counterintuitive fact (see
+      proof below) that each of the three coordinates of a uniformly
+      distributed point on S^2 is uniformly distributed on [-1,1] (but
+      the three are not independent, obviously).  Therefore, it
+      suffices to choose one axis (Z, say) and generate a uniformly
+      distributed value on that axis.  This constrains the chosen point
+      to lie on a circle parallel to the X-Y plane, and the obvious
+      trig method may be used to obtain the remaining coordinates.
+
+	(a) Choose z uniformly distributed in [-1,1].
+	(b) Choose t uniformly distributed on [0, 2*pi).
+	(c) Let r = sqrt(1-z^2).
+	(d) Let x = r * cos(t).
+	(e) Let y = r * sin(t).
+
+  (4) A variation of method (3) that doesn't use trig may be found in
+      Knuth, _loc. cit._.  The method is equivalent to the following:
+
+	(a) Choose u and v uniformly distributed on [-1,1].
+	(b) Let s = u^2 + v^2.
+	(c) If s > 1, return to step (a).
+	(d) Let a = 2 * sqrt(1-s).
+	(e) The desired point is (a*u, a*v, 2*s-1).
+      
+      This method uses two-dimensional rejection, which gives a higher
+      probability of acceptance compared to algorithm (2) in three
+      dimensions.  I group this with the trig method because both
+      depend on the fact that the projection on any axis is uniformly
+      distributed on [-1,1], and therefore both methods are limited to
+      use on S^2.
+
+I have found the trig method to be fastest in tests on an IBM RS/6000
+model 590, using the XLF 3.2 Fortran compiler, with the IMSL
+subroutines DRNUN and DRNNOA generating the random numbers.  The trig
+routines were accelerated by using the MASS library, and sqrt
+computations were performed by the hardware sqrt instruction available
+on the 590.  Timings for 1 million random points on S^2:
+
+	(1) normal-deviate method                   9.60 sec.  
+	(2) 3-D rejection                           8.84 sec.  
+	(3) trig method                             2.71 sec.  
+	(4) polar with 2-D rejection                5.08 sec.
+
+The fact that algorithm (3) does not involve rejection turns out to be
+a huge advantage because it becomes possible to generate all the points
+in a single loop with no conditional statements, which optimizes very
+effectively and more than offsets the cost of using trig functions.
+The alternative with algorithms (2) and (4) is to generate the points
+one at a time and test each one for acceptance before proceeding to the
+next, which requires nested loops and many more subroutine calls to
+generate the random numbers.  Algorithm (1) does not explicitly use
+rejection, but the IMSL subroutine DRNNOA uses rejection in generating
+the normally-distributed numbers that are required.
+
+In higher dimensions, the normal-deviate method is preferred
+because the probability of rejection in the hypercube method
+grows very rapidly with the dimension.
+
+--------------------------------------------------------------------
+
+Here is a proof of correctness for the fact that the z-coordinate is
+uniformly distributed.  The proof also works for the x- and y-
+coordinates, but note that this works only in 3-space.
+
+The area of a surface of revolution in 3-space is given by
+
+	A = 2 * pi * int_a^b f(x) * sqrt(1 + [f'(x}]^2) dx
+
+Consider a zone of a sphere of radius R.  Since we are integrating in
+the z direction, we have
+
+	f(z) = sqrt(R^2 - z^2)
+	f'(z) = -z / sqrt(R^2-z^2)
+
+	1 + [f'(z)]^2 = r^2 / (R^2-z^2)
+
+	A = 2 * pi * int_a^b sqrt(R^2-z^2) * R/sqrt(R^2-z^2) dz
+
+	  = 2 * pi * R int_a^b dz
+
+	  = 2 * pi * R * (b-a)
+
+	  = 2 * pi * R * h, 
+	  
+where h = b-a is the vertical height of the zone.  Notice how the integrand
+reduces to a constant.  The density is therefore uniform.
+
+-- 
+Dave Seaman			dseaman@purdue.edu
+      ++++ stop the execution of Mumia Abu-Jamal ++++
+    ++++ if you agree copy these lines to your sig ++++
+++++ see http://www.xs4all.nl/~tank/spg-l/sigaction.htm ++++
+==============================================================================
+
+From: George Marsaglia <geo@stat.fsu.edu>
+Subject: Random points on a sphere.
+Date: Tue, 15 Jun 1999 18:03:07 -0400
+Newsgroups: sci.math.num-analysis,sci.math,sci.stat.math
+
+Questions on methods for random sampling from the surface
+of a sphere keep coming up, and seemingly get lost as
+new generations of computer users encounter the problem.
+
+I described the methods that I had used for years in
+"Sampling from the surface of a sphere", Ann Math Stat, v43, 1972,
+recommending the two that seemed the fastest:
+
+1) Choose standard normal variates x1,x2,x3,
+put R=1/sqrt(x1^2+x2^2+x3^2) and return the point
+(x1*R,x2*R,x3*R).
+
+2) Generate u and v, uniform in [-1,1]  until S=u^2+v^2 < 1,
+then return (1-2*S,u*r,v*r), with r=2*sqrt(1-S).
+
+The second method is fast and easiest to program---unless
+one already has a fast normal generator in his library.
+
+Method 1) seems by far the fastest (and easily applies to higher
+dimensions) if my latest normal generator is used.
+It produces normal variates at the rate of seven million
+per second in a 300MHz Pc.
+
+George Marsaglia
author	sanine <sanine.not@pm.me>	2022-01-24 19:34:28 -0600
committer	sanine <sanine.not@pm.me>	2022-01-24 19:34:28 -0600
commit	ec20b587362d76d6c48ecc1a5c1e65f1bb9293da (patch)
tree	e6739fc0925b007eee136a1b90a5d3d0abba8494