home
Madison Howard
[email protected]
Profile
Inbox
Activity
Setting
Sing out
?>
timeline
about
photos
friends
more
edit profile
share
Share Your Mood
×
eyio
2025-01-27 20:02:50
copy link to adda
edit post
embed adda
[D] How exactly did Deepseek R1 achieve massive training cost reductions, most posts I read are about its performance, RL, chain of thought, etc, but it’s not clear how the cost of training of the model was brought down so drastically
You and 201 people like this
201
41
07