compress compresses a placefile’s pqueries.
usage: compress [options] placefile
| --point-mass | Treat every pquery as a point mass concentrated on the highest-weight placement. |
| --pp | Use posterior probability for the weight. |
| -o | Specify the filename to write to. |
| --out-dir | Specify the directory to write files to. |
| --prefix | Specify a string to be prepended to filenames. |
| --cutoff | The cutoff parameter for mass compression |
| --discard-below | |
| In island clustering, ignore pquery locations with a mass less than the specified value. | |
| --mcl | Use MCL clustering instead of island clustering. |
| --inflation | If specified, pass this as the inflation value to MCL. |
A cutoff c is specified via a command line flag. The compress command merges pairs of pqueries that have KR distance between them less than c.
To compress the pqueries:
Each original pquery (=node) will get merged into one of the the selected pqueries. This will happen as follows. Maintain a set of unmerged pqueries, and a set of pairs (w, d(w)), where w is a selected pquery and d(w) is the degree of w in the graph.
Stop when the unmerged pquery set is empty.