Main content

Home

Menu

Loading wiki pages...

View
Wiki Version:
# sketchlib To use these indices you will need to install the sketchlib software, which is available from https://github.com/bacpop/sketchlib.rust. You must have the [rust toolchain](https://www.rust-lang.org/tools/install) installed, clone the repository, then run `cargo install --path .`. Use `sketchlib -h` to see the help or e.g. `sketchlib dist -h` for explanation of subcommands. The distributed index is sketch size 1024 with k=17. You can create different ranges using the sketch subcommand. To calculate distances of a subset of the data, use a command such as: ``` sketchlib dist -v -k 17 --subset Haemophilus_influenzae.txt --ani atb_sketchlib_v020 --threads 4 > dists.txt ``` Where the --subset file contains the list of samples you want to include. Removing --ani will calculate Jaccard distances. To query new samples against the index, first sketch it: ``` sketchlib sketch -v -o query -k 17 -f queries.tsv -s 1000 ``` (where queries.tsv contains query samples with name and file location, tab separated) Then query it ``` sketchlib dist -v -k 17 atb_sketchlib_v020 query ```
OSF does not support the use of Internet Explorer. For optimal performance, please switch to another browser.
Accept
This website relies on cookies to help provide a better user experience. By clicking Accept or continuing to use the site, you agree. For more information, see our Privacy Policy and information on cookie use.
Accept
×

Start managing your projects on the OSF today.

Free and easy to use, the Open Science Framework supports the entire research lifecycle: planning, execution, reporting, archiving, and discovery.