R1 is probably not 6x cheaper than o1-mini and 30x cheaper than o1 in terms of the actual, underlying cost. (meaning that DeepSeek probably charges a much lower gross margin on its API than OpenAI does). R1 has 37B active parameters (though its 671B total parameters are also relevant). We don’t know how many parameters o1-mini or o1 have, but IMO they’re probably a lot less than ~200B and ~1T, respectively.
R1 is probably not 6x cheaper than o1-mini and 30x cheaper than o1 in terms of the actual, underlying cost. (meaning that DeepSeek probably charges a much lower gross margin on its API than OpenAI does). R1 has 37B active parameters (though its 671B total parameters are also relevant). We don’t know how many parameters o1-mini or o1 have, but IMO they’re probably a lot less than ~200B and ~1T, respectively.