RedPajama v2 Open Dataset with 30T Tokens for Training LLMs
📰 Hacker News · programd
RedPajama v2 Open Dataset with 30T Tokens for Training LLMs. 60 comments, 236 points on Hacker News.
RedPajama v2 Open Dataset with 30T Tokens for Training LLMs. 60 comments, 236 points on Hacker News.