AWS SageMaker Introduces GPU Capacity Reservations for AI Inference Endpoints
Data scientists can now pre-allocate dedicated p-family GPU capacity for SageMaker AI inference endpoints, streamlining model evaluation and deployment with guaranteed resources.