[07/27/25 11:25:28] INFO ['prompt']: 'What is an LLM?' trainer.py:790
INFO ['response']: trainer.py:794
What is an
LLM?wZbbbT'3weew,'foBB.qWWlpwes.qqQevFAA.bbvFF-AkacWWfYhx3fooB'''';vvee
sppWW
eeWA3ZZppPZe;dCCvres
;ecc--Ws'cqor,JZVVVCCeepfqqWxApBBBBhh;;JeQhMMss,,wshrhW?BiMWYqqwwwAASSw
rrroo,rqtWseMq.Ak'ofA,,'t,,..hh;xx'?sAq';cqxrqWkeMqt'gzAAxhrpqt'g't;?bt
oseq-pqq'qAtttt,eqrM
[07/27/25 11:25:31] INFO Saving checkpoint to: trainer.py:733
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example
INFO Saving model to: trainer.py:734
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example/model.pth
INFO Appending configs.py:141
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example to
/Users/samforeman/projects/saforem2/wordplay/src/ckpts/checkpoints.log
INFO step=510 loss=4.25518 dt=0.0193091 dtf=0.0189602 dtb=0.000122416 trainer.py:850
sps=51.789 sps_per_gpu=51.789 tps=6628.99 tps_per_gpu=6628.99
mfu=0.143245
INFO step=520 loss=4.20906 dt=0.0182869 dtf=0.0179924 dtb=0.000112625 trainer.py:850
sps=54.684 sps_per_gpu=54.684 tps=6999.56 tps_per_gpu=6999.56
mfu=0.144046
[07/27/25 11:25:32] INFO step=530 loss=4.22394 dt=0.0183378 dtf=0.0179662 dtb=0.000141666 trainer.py:850
sps=54.5322 sps_per_gpu=54.5322 tps=6980.12 tps_per_gpu=6980.12
mfu=0.144724
INFO step=540 loss=4.23923 dt=0.018275 dtf=0.0179809 dtb=0.000123958 trainer.py:850
sps=54.7196 sps_per_gpu=54.7196 tps=7004.1 tps_per_gpu=7004.1
mfu=0.145387
INFO step=550 loss=4.24928 dt=0.0200772 dtf=0.0197448 dtb=0.000128708 trainer.py:850
sps=49.8077 sps_per_gpu=49.8077 tps=6375.39 tps_per_gpu=6375.39
mfu=0.144625
[07/27/25 11:25:34] INFO ['prompt']: 'What is an LLM?' trainer.py:790
INFO ['response']: trainer.py:794
What is an
LLM?wboG',ZZswPZZhsf'V.h;QrppwAfAa''qWWYYfOOx33fvkkfQ'elccB3kkkm....swe
vfsssoAkfQss
'f;ehewqs3--seuCeerqfQA,XXqooU;?';QhdI'M;;astc;W;?A;p;p',,'''gosS;;WW?'
errs'fwwr''qqWW,w'l;''www''tppwbQWWseSSqYtLtSbQQQ'q;qqM'tbqW,s'r.AAtcbb
q-'ttuuA,;;;Q'S;;;ttMglqYetqeSS;Wq
[07/27/25 11:25:37] INFO Saving checkpoint to: trainer.py:733
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example
INFO Saving model to: trainer.py:734
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example/model.pth
INFO Appending configs.py:141
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example to
/Users/samforeman/projects/saforem2/wordplay/src/ckpts/checkpoints.log
INFO step=560 loss=4.21979 dt=0.0185737 dtf=0.0182987 dtb=0.000109708 trainer.py:850
sps=53.8395 sps_per_gpu=53.8395 tps=6891.46 tps_per_gpu=6891.46
mfu=0.145054
[07/27/25 11:25:38] INFO step=570 loss=4.27896 dt=0.018959 dtf=0.0185998 dtb=0.000151583 trainer.py:850
sps=52.7454 sps_per_gpu=52.7454 tps=6751.41 tps_per_gpu=6751.41
mfu=0.145138
INFO step=580 loss=4.25036 dt=0.0188471 dtf=0.0184447 dtb=0.00018775 trainer.py:850
sps=53.0586 sps_per_gpu=53.0586 tps=6791.5 tps_per_gpu=6791.5
mfu=0.1453
INFO step=590 loss=4.30325 dt=0.021447 dtf=0.0210627 dtb=0.0001295 trainer.py:850
sps=46.6266 sps_per_gpu=46.6266 tps=5968.2 tps_per_gpu=5968.2
mfu=0.143666
INFO step=600 loss=4.24977 dt=0.0181719 dtf=0.0174561 dtb=0.000136083 trainer.py:850
sps=55.03 sps_per_gpu=55.03 tps=7043.84 tps_per_gpu=7043.84 mfu=0.14452
[07/27/25 11:25:40] INFO ['prompt']: 'What is an LLM?' trainer.py:790
INFO ['response']: trainer.py:794
What is an LLM?LQ3vvye! wePZ
ewbAII''QYUfY.vTcaQlccCfhsZblYe''vS'xqosfoxCx'q33ckkxpppcecZZ-caqAb''fQ
-eqb'.AGGGZZ?--s..h.ttppMq3ZQs,e';pwsf..se;;pqtcenr'.nxnqqgbqQYtttM'fSb
ttcqqqqgYYjjrqfAkkSSSuQqoh'''S;SYYYAG;SSSo'QQQuu;'QSfqo'.tgSggkqWYYbbvq
qtuiqrhS;QC'QSrSbWWSJJeuuiWYu
[07/27/25 11:25:43] INFO Saving checkpoint to: trainer.py:733
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example
INFO Saving model to: trainer.py:734
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example/model.pth
INFO Appending configs.py:141
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example to
/Users/samforeman/projects/saforem2/wordplay/src/ckpts/checkpoints.log
INFO step=610 loss=4.27699 dt=0.0194192 dtf=0.019049 dtb=0.000122208 trainer.py:850
sps=51.4955 sps_per_gpu=51.4955 tps=6591.43 tps_per_gpu=6591.43
mfu=0.144312
INFO step=620 loss=4.2417 dt=0.0203904 dtf=0.0201204 dtb=0.000116084 trainer.py:850
sps=49.0427 sps_per_gpu=49.0427 tps=6277.47 tps_per_gpu=6277.47
mfu=0.143445
[07/27/25 11:25:44] INFO step=630 loss=4.1949 dt=0.0202023 dtf=0.0199125 dtb=0.000115 trainer.py:850
sps=49.4992 sps_per_gpu=49.4992 tps=6335.9 tps_per_gpu=6335.9
mfu=0.142792
INFO step=640 loss=4.21554 dt=0.0184285 dtf=0.0181117 dtb=0.000119542 trainer.py:850
sps=54.2639 sps_per_gpu=54.2639 tps=6945.78 tps_per_gpu=6945.78
mfu=0.143522
INFO step=650 loss=4.26643 dt=0.0191115 dtf=0.018803 dtb=0.000116417 trainer.py:850
sps=52.3245 sps_per_gpu=52.3245 tps=6697.54 tps_per_gpu=6697.54
mfu=0.143642
[07/27/25 11:25:46] INFO ['prompt']: 'What is an LLM?' trainer.py:790
INFO ['response']: trainer.py:794
What is an
LLM?qadZ--e'ovTqro'qE'rpAYvrr;qo3AAwUA-sG..qqbaNNyyep;blgWVe''tkaoo,ebq
qUAAAAxttmZS.tGlAxxtccZAk'qffhMM;hqcZ
'rvsoAAtqWtt,'MqWtt'qqqQ--zpttttuq3brqtrrha;WW'eq;cqqqqrrhh-ppq;'SSJrhS
YSJqg'',asqqAhdqbv'?Bqqqb',fqSqt'QqAAWAAqqQQQttttIffvqeWYY--?MfSpppMttt
tBBM'KK..
[07/27/25 11:25:49] INFO Saving checkpoint to: trainer.py:733
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example
INFO Saving model to: trainer.py:734
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example/model.pth
INFO Appending configs.py:141
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example to
/Users/samforeman/projects/saforem2/wordplay/src/ckpts/checkpoints.log
INFO step=660 loss=4.17238 dt=0.0189814 dtf=0.0186691 dtb=0.000131375 trainer.py:850
sps=52.6832 sps_per_gpu=52.6832 tps=6743.45 tps_per_gpu=6743.45
mfu=0.14385
[07/27/25 11:25:50] INFO step=670 loss=4.33205 dt=0.0193104 dtf=0.0189986 dtb=0.000128042 trainer.py:850
sps=51.7856 sps_per_gpu=51.7856 tps=6628.56 tps_per_gpu=6628.56
mfu=0.143789
INFO step=680 loss=4.17701 dt=0.0183742 dtf=0.0180271 dtb=0.000151375 trainer.py:850
sps=54.4241 sps_per_gpu=54.4241 tps=6966.29 tps_per_gpu=6966.29
mfu=0.144463
INFO step=690 loss=4.23023 dt=0.0177905 dtf=0.0175473 dtb=9.91249e-05 trainer.py:850
sps=56.2098 sps_per_gpu=56.2098 tps=7194.85 tps_per_gpu=7194.85
mfu=0.145564
INFO step=700 loss=4.19011 dt=0.0194102 dtf=0.0188519 dtb=0.000118375 trainer.py:850
sps=51.5194 sps_per_gpu=51.5194 tps=6594.48 tps_per_gpu=6594.48
mfu=0.145257
[07/27/25 11:25:52] INFO ['prompt']: 'What is an LLM?' trainer.py:790
INFO ['response']: trainer.py:794
What is an LLM?lrvqqrafQEsA,hrccZZ;'rrkf'c x'Xxqad.SSxtaV!XQUxv;a.'g
Zto..herovV-qA'K;aZs3ecAq
vqq.!c'fos,ssAAcqfop-;AA.Ag.WYYvvqttxW,,eq;;..Mww';QtMMgqeeqYYppppp;;..
MW'tqYf.ff';ccWYrrS'SAsSohegQrr'rhWSASpgj'.A;;.eqqqqqeWWofYQYtcb'Q;;;tt
tuqcgk;.t3tSbYhhouI;ppp;tSfvgQSuSq
[07/27/25 11:25:55] INFO Saving checkpoint to: trainer.py:733
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example
INFO Saving model to: trainer.py:734
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example/model.pth
INFO Appending configs.py:141
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example to
/Users/samforeman/projects/saforem2/wordplay/src/ckpts/checkpoints.log
INFO step=710 loss=4.25752 dt=0.0197687 dtf=0.0193927 dtb=0.000144125 trainer.py:850
sps=50.585 sps_per_gpu=50.585 tps=6474.88 tps_per_gpu=6474.88
mfu=0.144723
[07/27/25 11:25:56] INFO step=720 loss=4.22592 dt=0.0186651 dtf=0.0175268 dtb=0.0001345 trainer.py:850
sps=53.5759 sps_per_gpu=53.5759 tps=6857.71 tps_per_gpu=6857.71
mfu=0.14507
INFO step=730 loss=4.18346 dt=0.0178852 dtf=0.017587 dtb=0.000127 trainer.py:850
sps=55.9123 sps_per_gpu=55.9123 tps=7156.77 tps_per_gpu=7156.77
mfu=0.146028
INFO step=740 loss=4.22937 dt=0.018805 dtf=0.0184613 dtb=0.000150958 trainer.py:850
sps=53.1772 sps_per_gpu=53.1772 tps=6806.69 tps_per_gpu=6806.69
mfu=0.146133
INFO step=750 loss=4.22004 dt=0.0185913 dtf=0.0181662 dtb=0.000108125 trainer.py:850
sps=53.7887 sps_per_gpu=53.7887 tps=6884.96 tps_per_gpu=6884.96
mfu=0.146398
[07/27/25 11:25:58] INFO ['prompt']: 'What is an LLM?' trainer.py:790
INFO ['response']: trainer.py:794
What is an LLM?.AvexhjjsAxx3AAAAffyyY'rr.AxZZpaff.yykfAqYEZ
'koBf''3YYo.hzA,aaqbbZ
ttQhhxkeQU'qhqqoqq!!'ffor'f.aZPeG'qW.ttvafA-b??fffvfvYrcL.bWtSS??qtLtQu
tohdyyppu''rrSqYqc'KKye''''gjjQq'fgJq;;.'gYqrkssW'tp;bqqf.qowqoMM'qQQSq
qWssgyttu?qoo'ff''kkSSffAr.MggesgIIBBYeeWqqqqg
[07/27/25 11:26:01] INFO Saving checkpoint to: trainer.py:733
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example
INFO Saving model to: trainer.py:734
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example/model.pth
INFO Appending configs.py:141
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example to
/Users/samforeman/projects/saforem2/wordplay/src/ckpts/checkpoints.log
INFO step=760 loss=4.16349 dt=0.0194697 dtf=0.0191296 dtb=0.000143083 trainer.py:850
sps=51.3619 sps_per_gpu=51.3619 tps=6574.33 tps_per_gpu=6574.33
mfu=0.145964
INFO step=770 loss=4.22062 dt=0.0193039 dtf=0.018953 dtb=0.0001385 trainer.py:850
sps=51.803 sps_per_gpu=51.803 tps=6630.78 tps_per_gpu=6630.78
mfu=0.145696
[07/27/25 11:26:02] INFO step=780 loss=4.16916 dt=0.0171542 dtf=0.0168228 dtb=0.000155208 trainer.py:850
sps=58.2949 sps_per_gpu=58.2949 tps=7461.74 tps_per_gpu=7461.74
mfu=0.147251
INFO step=790 loss=4.21405 dt=0.0176518 dtf=0.0173884 dtb=0.000118 trainer.py:850
sps=56.6515 sps_per_gpu=56.6515 tps=7251.39 tps_per_gpu=7251.39
mfu=0.148195
INFO step=800 loss=4.23569 dt=0.037451 dtf=0.0371191 dtb=0.000127167 trainer.py:850
sps=26.7016 sps_per_gpu=26.7016 tps=3417.8 tps_per_gpu=3417.8
mfu=0.140761
[07/27/25 11:26:04] INFO ['prompt']: 'What is an LLM?' trainer.py:790
INFO ['response']: trainer.py:794
What is an
LLM??.ahoskZqeofpQe'v;.p..hqYwqaarswbbc.ahwbkkA''KyhvX.yp'Vc3;oseo.xeee
aa'WQqfhKKfYqqqf.x33xx--;;;.egMcc-qaaovvKKOsvSpwesfgI;;wwerpMgtcgQsb;uQ
tggyyptokyy';QCy;;asoW,,Jr''''',AkkfYoAAAAAS::::;;.bWttqeqcbA::gYJJbqgj
oBhopwe;.s''ggkk'qk.qkGWYYyqqe;''Sbs'MM;;.qqqqQ
[07/27/25 11:26:07] INFO Saving checkpoint to: trainer.py:733
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example
INFO Saving model to: trainer.py:734
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example/model.pth
INFO Appending configs.py:141
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example to
/Users/samforeman/projects/saforem2/wordplay/src/ckpts/checkpoints.log
INFO step=810 loss=4.22317 dt=0.0203105 dtf=0.0199397 dtb=0.000126916 trainer.py:850
sps=49.2356 sps_per_gpu=49.2356 tps=6302.16 tps_per_gpu=6302.16
mfu=0.140303
[07/27/25 11:26:08] INFO step=820 loss=4.24584 dt=0.0213863 dtf=0.0210762 dtb=0.000128834 trainer.py:850
sps=46.7589 sps_per_gpu=46.7589 tps=5985.14 tps_per_gpu=5985.14
mfu=0.139206
INFO step=830 loss=4.1855 dt=0.0176513 dtf=0.0172706 dtb=0.000152417 trainer.py:850
sps=56.6529 sps_per_gpu=56.6529 tps=7251.58 tps_per_gpu=7251.58
mfu=0.140955
INFO step=840 loss=4.24083 dt=0.018392 dtf=0.0180307 dtb=0.0001385 trainer.py:850
sps=54.3716 sps_per_gpu=54.3716 tps=6959.56 tps_per_gpu=6959.56
mfu=0.141898
INFO step=850 loss=4.23785 dt=0.0192448 dtf=0.0189111 dtb=0.000127 trainer.py:850
sps=51.9622 sps_per_gpu=51.9622 tps=6651.16 tps_per_gpu=6651.16
mfu=0.142081
[07/27/25 11:26:10] INFO ['prompt']: 'What is an LLM?' trainer.py:790
INFO ['response']: trainer.py:794
What is an
LLM?A;QfqrqQ'xxx'aa.hh3vv''wwossqZse'rxfQsseh'.evrpMq''.xxTUeQ'''rqqaxf
xtcbqcf3qq3jZbvcepwA,,,ff'hpqcpcA-A'rv::errrvbbZ:pc-qycSScWlbQYhhwwAA-S
QCgl;bbrpbSrrrrqqqqq''rWqqtcAkYyqgYtxttttbkkqQWWqaqqqkkk,'qqexrrWSSqyyY
j'SyyQYQQ,q''p'---p''tcqzhhhpqWfs.p'foBqqQt::eu
[07/27/25 11:26:13] INFO Saving checkpoint to: trainer.py:733
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example
INFO Saving model to: trainer.py:734
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example/model.pth
[07/27/25 11:26:14] INFO Appending configs.py:141
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example to
/Users/samforeman/projects/saforem2/wordplay/src/ckpts/checkpoints.log
INFO step=860 loss=4.20116 dt=0.0179678 dtf=0.0176636 dtb=0.000124459 trainer.py:850
sps=55.655 sps_per_gpu=55.655 tps=7123.84 tps_per_gpu=7123.84
mfu=0.143267
INFO step=870 loss=4.22428 dt=0.0205305 dtf=0.0186659 dtb=0.000150667 trainer.py:850
sps=48.7079 sps_per_gpu=48.7079 tps=6234.61 tps_per_gpu=6234.61
mfu=0.142412
INFO step=880 loss=4.22977 dt=0.0189898 dtf=0.018688 dtb=0.00011875 trainer.py:850
sps=52.6599 sps_per_gpu=52.6599 tps=6740.46 tps_per_gpu=6740.46
mfu=0.142737
INFO step=890 loss=4.22047 dt=0.0202268 dtf=0.0199305 dtb=0.0001135 trainer.py:850
sps=49.4395 sps_per_gpu=49.4395 tps=6328.25 tps_per_gpu=6328.25
mfu=0.142137
INFO step=900 loss=4.35563 dt=0.019475 dtf=0.0189142 dtb=0.000115833 trainer.py:850
sps=51.348 sps_per_gpu=51.348 tps=6572.54 tps_per_gpu=6572.54
mfu=0.142126
[07/27/25 11:26:16] INFO ['prompt']: 'What is an LLM?' trainer.py:790
INFO ['response']: trainer.py:794
What is an
LLM?wwPA'eeew-3ZAjRwqs33eafCq'ax..xcxc''awA',bsettcCvCqqq33A-.bsor.awQf
J$ 3a-3b U' Zq3gQQf',,AqGZ
fhhPwU.vfCC.xpqvr.SkkofxsyQrrs';'kGs,rMse''rppb'qqfoktM'qo,qqSqgW,etM'M
??Z;auYfSSo??gg'sSvSQQqfftcb;;;;pWQSffttqgQSSSkllbrqqaw,'SqqYQ;;;pqqtpB
heW;;;.hn'qYyMMesgl
[07/27/25 11:26:19] INFO Saving checkpoint to: trainer.py:733
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example
INFO Saving model to: trainer.py:734
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example/model.pth
[07/27/25 11:26:21] INFO Appending configs.py:141
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example to
/Users/samforeman/projects/saforem2/wordplay/src/ckpts/checkpoints.log
INFO step=910 loss=4.19569 dt=0.0184239 dtf=0.0181274 dtb=0.000126584 trainer.py:850
sps=54.2774 sps_per_gpu=54.2774 tps=6947.51 tps_per_gpu=6947.51
mfu=0.142926
INFO step=920 loss=4.23206 dt=0.0189052 dtf=0.0186322 dtb=0.00011175 trainer.py:850
sps=52.8955 sps_per_gpu=52.8955 tps=6770.62 tps_per_gpu=6770.62
mfu=0.143264
[07/27/25 11:26:22] INFO step=930 loss=4.29058 dt=0.0204312 dtf=0.0200622 dtb=0.0001525 trainer.py:850
sps=48.9446 sps_per_gpu=48.9446 tps=6264.91 tps_per_gpu=6264.91
mfu=0.142476
INFO step=940 loss=4.211 dt=0.0308806 dtf=0.0188316 dtb=0.000154834 trainer.py:850
sps=32.3828 sps_per_gpu=32.3828 tps=4145 tps_per_gpu=4145 mfu=0.137185
INFO step=950 loss=4.18626 dt=0.0178002 dtf=0.0175009 dtb=0.000114584 trainer.py:850
sps=56.179 sps_per_gpu=56.179 tps=7190.91 tps_per_gpu=7190.91
mfu=0.139005
[07/27/25 11:26:24] INFO ['prompt']: 'What is an LLM?' trainer.py:790
INFO ['response']: trainer.py:794
What is an
LLM?YfQooooRx3xccaHCvj3gllexpjGG,wUxe'oOf.smxxxrq-jj'kxxrkc3fkkeQZZe''Y
R'JhrZZAcowccpqA,QUJZpcAkkGGGqp--.v'appbYYbeeqbbZrk'MBfq-srksqYee'QQt'J
',qWqt;qkGWbrrtqJ-'pa'ggjJSq--'sf'..;''aqfpfx'Sbbq3tooMbb?',AA-AW'MqAAk
;ccAGqQqaA;WQhMSq;cffho,eWohpWott3jj---s;?ggIIS
[07/27/25 11:26:27] INFO Saving checkpoint to: trainer.py:733
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example
INFO Saving model to: trainer.py:734
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example/model.pth
INFO Appending configs.py:141
/Users/samforeman/projects/saforem2/intro-hpc-bootcamp-2025/content/02-
llms/07-shakespeare-example to
/Users/samforeman/projects/saforem2/wordplay/src/ckpts/checkpoints.log
INFO step=960 loss=4.225 dt=0.0210933 dtf=0.0207466 dtb=0.00012575 trainer.py:850
sps=47.4083 sps_per_gpu=47.4083 tps=6068.27 tps_per_gpu=6068.27
mfu=0.138218
[07/27/25 11:26:28] INFO step=970 loss=4.17741 dt=0.0178491 dtf=0.0175596 dtb=0.000125458 trainer.py:850
sps=56.0252 sps_per_gpu=56.0252 tps=7171.22 tps_per_gpu=7171.22
mfu=0.139892
INFO step=980 loss=4.1707 dt=0.0166487 dtf=0.0163776 dtb=0.000111583 trainer.py:850
sps=60.0647 sps_per_gpu=60.0647 tps=7688.28 tps_per_gpu=7688.28
mfu=0.142516
INFO step=990 loss=4.1891 dt=0.0180315 dtf=0.0177192 dtb=0.000119167 trainer.py:850
sps=55.4585 sps_per_gpu=55.4585 tps=7098.69 tps_per_gpu=7098.69
mfu=0.143604
INFO step=1000 loss=4.2423 dt=0.022806 dtf=0.0224982 dtb=0.000120917 trainer.py:850
sps=43.8482 sps_per_gpu=43.8482 tps=5612.57 tps_per_gpu=5612.57
mfu=0.141372