Operation Issues#
Common issues with Map, Vanilla, YQL, S3, Docker, and Checkpoint operations.
Map Operation Issues#
Mapper script not found#
Error:
FileNotFoundError: stages/my_stage/src/mapper.py not found
Solution:
Verify
src/mapper.pyexistsCheck file name is exactly
mapper.pyEnsure file is in correct location
Mapper script errors#
Error:
SyntaxError in mapper.py
Solution:
Check Python syntax
Verify imports are correct
Test mapper script locally
Review error messages
Input table not found#
Error:
FileNotFoundError: Input table not found
Solution:
Verify input table path is correct
Check table exists (use
yt_client.exists())Ensure previous stage created the table
Review table path in config
Output table creation fails#
Error:
Error: Cannot create output table
Solution:
Check YT permissions
Verify output path is correct
Ensure parent directory exists
Check disk space on cluster
Vanilla Operation Issues#
Vanilla script not found#
Error:
FileNotFoundError: stages/my_stage/src/vanilla.py not found
Solution:
Verify
src/vanilla.pyexistsCheck file name is exactly
vanilla.pyEnsure file is in correct location
Vanilla script errors#
Error:
RuntimeError: Vanilla operation failed
Solution:
Check script syntax
Verify script has
if __name__ == "__main__": main()blockReview operation logs
Test script locally
YQL Operation Issues#
Join fails#
Error:
Error: Join operation failed
Solution:
Check column names match
Verify table schemas are compatible
Ensure tables exist
Review join configuration
Filter condition error#
Error:
SyntaxError: Invalid filter condition
Solution:
Use proper SQL-like syntax
Escape special characters
Check column names exist
Verify condition syntax
Aggregation fails#
Error:
Error: Aggregation operation failed
Solution:
Verify column types are numeric (for sum/avg)
Check column names exist
Ensure group_by columns exist
Review aggregation configuration
S3 Integration Issues#
S3 client creation fails#
Error:
Error: Failed to create S3 client
Solution:
Check AWS credentials in
secrets.envVerify credentials have S3 access
Check AWS region is correct
Review credential format
S3 files not found#
Error:
Warning: No files found in S3
Solution:
Verify bucket name is correct
Check prefix path is correct
Ensure files exist in S3
Review S3 permissions
S3 permission denied#
Error:
PermissionError: Access denied to S3 bucket
Solution:
Check IAM permissions for S3 access
Verify credentials have read/list permissions
Check bucket policy
Review AWS credentials
Docker Issues#
Docker image not found#
Error:
Error: Docker image not found
Solution:
Check image name and tag
Verify image exists in registry
Check Docker authentication
Review image path
Platform mismatch#
Error:
Error: Platform mismatch
Solution:
Build for
linux/amd64platformUse
docker buildxfor cross-platform buildsVerify image platform compatibility
GPU not available#
Error:
Error: GPU not available
Solution:
Verify GPU-enabled image
Check
gpu_limitis setEnsure cluster has GPU nodes
Review GPU resource allocation
Checkpoint Issues#
Checkpoint not found#
Error:
FileNotFoundError: Required checkpoint not found in YT
Solution:
Verify
checkpoint_basepath existsCheck
model_namematches filenameEnsure checkpoint was uploaded
Check YT permissions
Checkpoint upload fails#
Error:
Error: Failed to upload checkpoint
Solution:
Check
local_checkpoint_pathexistsVerify file permissions
Check YT credentials
Review upload logs
Checkpoint format error#
Error:
Error: Invalid checkpoint format
Solution:
Verify checkpoint format (PyTorch, etc.)
Check model loading code
Review checkpoint creation process
Test checkpoint loading locally
See Also#
Map Operations - Map operation guide
Vanilla Operations - Vanilla operation guide
YQL Operations - YQL operation guide
S3 Operations - S3 operation guide
Checkpoints - Checkpoint management guide
Docker Guide - Docker configuration guide