Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
thibaut_barrere
on June 23, 2015
|
parent
|
context
|
favorite
| on:
How Akka Streams can be used to process the Wikida...
What are your favorite large, publicly available datasets?
Smerity
on June 23, 2015
|
next
[–]
Biased reply (I'm a data scientist there): Common Crawl[1]. We build and maintain an open repository of web crawl data that can be accessed and analyzed by anyone completely free.
[1]:
http://commoncrawl.org/
rcpt
on June 24, 2015
|
prev
|
next
[–]
This thread is pretty good
http://www.quora.com/Where-can-I-find-large-datasets-open-to...
nextos
on June 23, 2015
|
prev
[–]
The Cancer Genome Atlas, Ensembl, 1000Genomes.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: