Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
106.7
TFLOPS
3
1
84
YarikDev
yarikdevcom
Follow
maxgreco's profile picture
NikolayKozloff's profile picture
quackdoc's profile picture
3 followers
·
15 following
AI & ML interests
Discovering
Recent Activity
liked
a model
9 days ago
Tesslate/OmniCoder-9B-GGUF
liked
a model
11 days ago
Tesslate/OmniCoder-9B
reacted
to
ajibawa-2023
's
post
with 🔥
19 days ago
Cpp-Code-Large Dataset: https://huggingface.co/datasets/ajibawa-2023/Cpp-Code-Large Cpp-Code-Large is a large-scale corpus of C++ source code comprising more than 5 million lines of C++ code. The dataset is designed to support research in large language model (LLM) pretraining, code intelligence, software engineering automation, and static program analysis for the C++ ecosystem. By providing a high-volume, language-specific corpus, Cpp-Code-Large enables systematic experimentation in C++-focused model training, domain adaptation, and downstream code understanding tasks. Cpp-Code-Large addresses the need for a dedicated C++-only dataset at substantial scale, enabling focused research across systems programming, performance-critical applications, embedded systems, game engines, and large-scale native software projects.
View all activity
Organizations
None yet
models
2
Sort: Recently updated
yarikdevcom/Intern-S1-mini-GGUF
8B
•
Updated
Aug 23, 2025
•
17
yarikdevcom/Seed-OSS-36B-Instruct-GGUF
Text Generation
•
36B
•
Updated
Aug 23, 2025
•
169
•
18
datasets
0
None public yet