People should be training model sizes that fit-and-fill consumer GPUs, ie: 2x 24...

		mirekrusin on May 16, 2023 \| parent \| context \| favorite \| on: StarCoder and StarCoderBase: 15.5B parameter model... People should be training model sizes that fit-and-fill consumer GPUs, ie: 2x 24G - for dual GPU ~ 28B model 1x 24G ~ 14B model etc.