Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
openenv-community
/
test-local-nested-envs
like
0
Running
on
T4
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
test-local-nested-envs
562 kB
Ctrl+K
Ctrl+K
4 contributors
History:
67 commits
KarlLearnsAI
Upload minimum_training_script.ipynb
37d5368
verified
about 1 month ago
assets
Upload architecture.png
about 1 month ago
layer0
Improve reward function to break refuse-everything local minimum and scale training
about 1 month ago
layer1
Pre-format SFT dataset as text column, drop formatting_func
about 1 month ago
layer2
Switch Llama 3.1 8B to ungated unsloth mirror
about 1 month ago
personas
Clean up dead code, unused imports, and move hardcoded values to config.yaml
about 1 month ago
scripts
Make Supabase uploads incremental — upload after every step
about 1 month ago
tests
Improve reward function to break refuse-everything local minimum and scale training
about 1 month ago
.gitattributes
Safe
60 Bytes
Upload assets/architecture.png with huggingface_hub
about 1 month ago
.gitignore
Safe
126 Bytes
Implement self-improving AI oversight system with nested RL environments
about 1 month ago
Dockerfile
Safe
199 Bytes
Add supabase to Dockerfile pip install
about 1 month ago
README.md
Safe
18.3 kB
Add HF Spaces config metadata to README
about 1 month ago
app.py
Safe
4.83 kB
Upload app.py with huggingface_hub
about 1 month ago
config.yaml
Safe
4.31 kB
Increase training scale: more steps, episodes, and SFT epochs
about 1 month ago
config_loader.py
Safe
5.69 kB
Add SFT warm start before GRPO and DB connectivity init check
about 1 month ago
minimum_training_script.ipynb
Safe
261 kB
Upload minimum_training_script.ipynb
about 1 month ago
pyproject.toml
Safe
917 Bytes
Move supabase to core dependencies
about 1 month ago
train.sh
Safe
1.04 kB
Add train.sh startup script and assets folder
about 1 month ago