# Code and data for reproducibility
Here we provide all necessary data required to reproduce the results of our study.
For all used GISAID sequences and metadata that we used to build the Kallisto indices, we provide here the lists of IDs. To reproduce our results, you need to download the FASTA sequences and metadata from [GISAID directly](https://gisaid.org/) due to Copyright reasons.
If you use this data please cite:
[Aßmann, E., Agrawal, S., Orschler, L., Böttcher, S., Lackner, S., & Hölzer, M. (2023). Impact of reference design on estimating SARS-CoV-2 lineage abundances from wastewater sequencing data. _bioRxiv_, 2023-06.](https://www.biorxiv.org/content/10.1101/2023.06.02.543047v1)