> During quantization we find that values in the network vary from 0->5000, but ... | Hacker News

Hacker News .hnnew | past | comments | ask | show | jobs | submit

		sampo on July 25, 2023 \| parent \| context \| favorite \| on: Attention Is Off By One > During quantization we find that values in the network vary from 0->5000, but 95% of values are <100. Quantizing this to 8bits would mean that our values would be in increments of about 20. Instead of using an 8bit integer with even step size quantification, wouldn't they still use an 8bit float?

zamalek on July 25, 2023 [–]

Possibly, it depends on the distribution of the vales. It would also make my examples far less straightforward :)

Either way you would still only have 256 discrete values.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact