1 article
BASIS technique reduces activation memory scaling, freeing up GPU resources for larger models.