ripe-oyster-50903
07/24/2024, 8:53 AMpython
) and trying to install torch works, but only cpu based, didnt manage to make cuda work so far.
Whats the best practice here ? The examples i found either didnt specify the base image or just installed torch with conda which results in cpu only for me when i tried it out.
I am using metaflow --with batch
based on the standard cloud-formation template.ripe-oyster-50903
07/24/2024, 10:54 AMripe-oyster-50903
07/24/2024, 10:55 AMBatchLaunchTemplateMetaFlow:
Type: AWS::EC2::LaunchTemplate
Properties:
LaunchTemplateName: "BatchLaunchTemplateMetaFlow"
LaunchTemplateData:
BlockDeviceMappings:
- DeviceName: /dev/xvda
Ebs:
VolumeSize: 100
VolumeType: gp2
ComputeEnvironment:
Type: AWS::Batch::ComputeEnvironment
DependsOn: BatchLaunchTemplateMetaFlow
Properties:
Type: MANAGED
ServiceRole: !GetAtt 'BatchExecutionRole.Arn'
ComputeResources:
MaxvCpus: !Ref MaxVCPUBatch
SecurityGroupIds:
- !GetAtt VPC.DefaultSecurityGroup
Type: !If [EnableFargateOnBatch, 'FARGATE', 'EC2']
Subnets:
- !Ref Subnet1
- !Ref Subnet2
MinvCpus: !If [EnableFargateOnBatch, !Ref AWS::NoValue, !Ref MinVCPUBatch]
InstanceRole: !If [EnableFargateOnBatch, !Ref AWS::NoValue, !GetAtt 'ECSInstanceProfile.Arn']
InstanceTypes: !If [EnableFargateOnBatch, !Ref AWS::NoValue, !Ref ComputeEnvInstanceTypes]
DesiredvCpus: !If [EnableFargateOnBatch, !Ref AWS::NoValue, !Ref DesiredVCPUBatch]
LaunchTemplate:
LaunchTemplateId: !Ref BatchLaunchTemplateMetaFlow
State: ENABLED
that was the relevant part of the cloud formation template that i did change in case someone has similar issuesancient-application-36103
07/24/2024, 4:13 PMpytorch
from the pytorch
conda channel? pytorch::pytorch
as package name will do the trickripe-oyster-50903
07/24/2024, 4:18 PMancient-application-36103
07/24/2024, 4:19 PMripe-oyster-50903
07/24/2024, 9:19 PM