Feature-Set Experiments with OOP
Intermediate
Data Science Engineering
Created by Pavel
· 12.03.2026 at 07:54 UTC
· 2 completed
Problem setup:
A Data Science team has many candidate features but can train only a subset per run due to compute limits.
Object model:
- FeaturePool: all available features,
- ExperimentRun: selected feature subset, random seed, and score,
- optional tracker class for comparing runs.
Core operation:
sample_features(k) draws k distinct features from the pool.
Use cases:
- ablation studies,
- quick baseline screening,
- reproducible random search over feature subsets.
Edge cases:
- k larger than available features,
- duplicate feature names,
- non-reproducible runs when seed is not recorded.
University approvals: 0
Tasks
Card Info
- Topic: Data Science Engineering
- Difficulty: Intermediate
- Completed: 2 users
Creator
Pavel