Commit Graph

24 Commits

Author SHA1 Message Date
Arthur Lu
3595fc1b07
Merge pull request #2 from ltcptgeneral/gym-2
add ppo model, add cpu and cuda device support
2024-03-20 21:18:19 -07:00
Arthur Lu
4f8ca4aa06 add ppo model, add cpu and cuda device support 2024-03-20 21:17:44 -07:00
Arthur Lu
2244de94ac
Merge pull request #1 from ltcptgeneral/gym-2
Gym 2
2024-03-20 21:07:19 -07:00
Arthur Lu
b46d335044 working!! 2024-03-20 21:01:08 -07:00
Ethan Shapiro
284a29d7af f 2024-03-20 19:53:50 -07:00
Ethan Shapiro
3747af9d22 added state saving 2024-03-20 19:52:13 -07:00
Arthur Lu
4fb81317f0 add letter_guess symlink, add model loading into ai.py 2024-03-20 17:31:27 -07:00
Arthur Lu
12601964bd add eval script for convienience 2024-03-20 12:59:14 -07:00
Arthur Lu
c448e02512 add evaluation to eric's wordle solver (eval.py) 2024-03-20 12:53:40 -07:00
Arthur Lu
848d385482 run model train, abt 3 avg reward 2024-03-20 12:18:15 -07:00
Arthur Lu
f40301cac9 delete gym-wordle, fix some issues in letter_guess gym, add wandb integration 2024-03-19 16:49:01 -07:00
Ethan Shapiro
fc197acb6e started new letter guess environment 2024-03-19 11:52:10 -07:00
Ethan Shapiro
e799c14ece new reward scheme 2024-03-18 11:25:14 -07:00
Ethan Shapiro
bbe9a1891c updated wordle to gymnasium env 2024-03-15 18:19:58 -07:00
Arthur Lu
9172326013 upload wordle env, fix indexing issue in wordle env, attempt to improve reward (no improvement) 2024-03-14 16:47:11 -07:00
Arthur Lu
4836be8121 remove debug prints 2024-03-14 15:00:19 -07:00
Arthur Lu
5672169073 copy the wordle env locally and fix the obs return 2024-03-14 14:49:17 -07:00
Arthur Lu
5ec123e0f1 minor changes 2024-03-13 13:57:23 -07:00
Arthur Lu
e9622b6f68 switch to notebook 2024-03-13 11:04:30 -07:00
ltcptgeneral
83e81722d2 this should probably be working but isn't 2024-03-12 22:14:03 -07:00
ltcptgeneral
320f2f81b7 delete tests 2024-03-12 21:42:59 -07:00
ltcptgeneral
c121415e31 save plot 2024-03-07 00:54:47 -08:00
ltcptgeneral
6cbffcdec1 feasible training times by dropping to bert-base 2024-03-06 21:48:09 -08:00
ltcptgeneral
6a4027afb7 minimal example of bert idea 2024-03-06 21:00:38 -08:00