Main content

Date created: | Last Updated:

: DOI | ARK

Creating DOI. Please wait...

Create DOI

Category: Data

Description: MagnetDB is a large-scale, open dataset that catalogs extensive BitTorrent metadata. It contains nearly 30 million torrent entries in a torrents table, along with over 950 million file-level entries in a files table. For each torrent, MagnetDB tracks core attributes such as its name, size, and discovery timestamps. For each file, it stores detailed metadata like file paths, sizes, video/audio codecs, matching confidence scores for video content to the non-commercial IMDb dataset, and more. By providing a unified schema across billions of rows, MagnetDB offers researchers, data scientists, and developers a powerful resource to study the supply side of the BitTorrent ecosystem, analyze file distribution patterns, or build content aggregation tools. It is distributed in a single SQLite database (over 100 GB uncompressed), along with sample extracts for quick exploration.

License: CC-By Attribution 4.0 International

Wiki

Add important information, links, or images here to describe your project.

Files

Files can now be accessed and managed under the Files tab.

Citation

Recent Activity

Loading logs...

OSF does not support the use of Internet Explorer. For optimal performance, please switch to another browser.
Accept
This website relies on cookies to help provide a better user experience. By clicking Accept or continuing to use the site, you agree. For more information, see our Privacy Policy and information on cookie use.
Accept
×

Start managing your projects on the OSF today.

Free and easy to use, the Open Science Framework supports the entire research lifecycle: planning, execution, reporting, archiving, and discovery.