compress compresses a placefile’s pqueries.
usage: compress [options] placefile
--point-mass | Treat every pquery as a point mass concentrated on the highest-weight placement. |
--pp | Use posterior probability for the weight. |
-o | Specify the filename to write to. |
--out-dir | Specify the directory to write files to. |
--prefix | Specify a string to be prepended to filenames. |
--cutoff | The cutoff parameter for mass compression |
--discard-below | |
In island clustering, ignore pquery locations with a mass less than the specified value. | |
--mcl | Use MCL clustering instead of island clustering. |
--inflation | If specified, pass this as the inflation value to MCL. |
A cutoff c is specified via a command line flag. The compress command merges pairs of pqueries that have KR distance between them less than c.
To compress the pqueries:
Each original pquery (=node) will get merged into one of the the selected pqueries. This will happen as follows. Maintain a set of unmerged pqueries, and a set of pairs (w, d(w)), where w is a selected pquery and d(w) is the degree of w in the graph.
Stop when the unmerged pquery set is empty.