a large Dataset of synchronised Audio, LyrIcs and vocal notes
Cross-referenced across 55 tracked directories
#3826
Popularity Rank
1 / 55
Listed In
Emerging
Adoption Stage
6/8/2018
Created
380
GitHub Stars
Score: 100/100
0 dependency vulnerabilities found
Run an AI-powered security scan to analyze this package's source code for vulnerabilities, prompt injection vectors, data exfiltration risks, and behavior mismatches.
Scans fetch actual source code from the GitHub repository, not just the README.
General Corpus of Contemporary Brazilian Portuguese with provenance and typology information - Corpus Geral do Português Brasileiro Contemporâneo
...moreexploring 12 million of the 2.3 billion images used to train Stable Diffusion's image generator
a foundational dataset by Meta for research on video learning and multimodal perception [Dataset Download](https://ego-exo4d-data.org/)
...morean open dataset with 30 trillion tokens for training Large Language Models
36
Forks
2
Open Issues
6/11/2020
Last Commit
Recently added to the ecosystem