Spaces:

openenv-community
/

test-local-nested-envs

Sleeping

App Files Files Community

test-local-nested-envs

Ctrl+K

Ctrl+K

4 contributors

History: 67 commits

KarlLearnsAI's picture

Upload minimum_training_script.ipynb

37d5368 verified 3 months ago

assets
Upload architecture.png 3 months ago
layer0
Improve reward function to break refuse-everything local minimum and scale training 3 months ago
layer1
Pre-format SFT dataset as text column, drop formatting_func 3 months ago
layer2
Switch Llama 3.1 8B to ungated unsloth mirror 3 months ago
personas
Clean up dead code, unused imports, and move hardcoded values to config.yaml 3 months ago
scripts
Make Supabase uploads incremental — upload after every step 3 months ago
tests
Improve reward function to break refuse-everything local minimum and scale training 3 months ago
.gitattributes

60 Bytes
Upload assets/architecture.png with huggingface_hub 3 months ago
.gitignore

126 Bytes
Implement self-improving AI oversight system with nested RL environments 3 months ago
Dockerfile

199 Bytes
Add supabase to Dockerfile pip install 3 months ago
README.md

18.3 kB
Add HF Spaces config metadata to README 3 months ago
app.py

4.83 kB
Upload app.py with huggingface_hub 3 months ago
config.yaml

4.31 kB
Increase training scale: more steps, episodes, and SFT epochs 3 months ago
config_loader.py

5.69 kB
Add SFT warm start before GRPO and DB connectivity init check 3 months ago
minimum_training_script.ipynb

261 kB
Upload minimum_training_script.ipynb 3 months ago
pyproject.toml

917 Bytes
Move supabase to core dependencies 3 months ago
train.sh

1.04 kB
Add train.sh startup script and assets folder 3 months ago