Pretty sure MosaicML also does this but I haven't used their offering.
https://www.amazon.science/blog/scaling-to-trillion-paramete...
Pretty sure MosaicML also does this but I haven't used their offering.
https://www.amazon.science/blog/scaling-to-trillion-paramete...