data-engineering

1 article
sort: new top best
clear filter
0 2/10

NVIDIA announces a suite of open datasets and training frameworks across multiple AI domains including robotics, autonomous vehicles, synthetic personas, protein modeling, and language model pre-training, with over 2 petabytes of data across 180+ datasets designed to reduce AI development bottlenecks.

NVIDIA Nemotron GR00T HuggingFace GitHub Runway CrowdStrike NTT Data APTO AI Singapore WideLabs Oxford Mila CIFAR Andrej Karpathy
huggingface.co · gmays · 1 day ago · details · hn