Feature Subset Optimization II: Heuristics, Baselines, and Validation Discipline
Advanced
Combinatorial Search in DS
Created by Pavel
· 12.03.2026 at 07:54 UTC
· 1 completed
When exhaustive search is too expensive, a common practical approach is greedy forward selection.
Forward-selection loop:
1. start with empty set,
2. evaluate adding each remaining feature,
3. add the feature with best score gain,
4. stop by criterion (max features, no improvement, budget).
Why baseline comparison matters:
- compare greedy search against random subset sampling,
- otherwise observed gains may be due to chance.
Validation discipline:
- use fixed split protocol or cross-validation,
- keep selection and evaluation separated to avoid leakage,
- test stability across random seeds.
Edge cases:
- local optimum traps,
- noisy score function,
- ties between candidate features.
University approvals: 0
Tasks
Card Info
- Topic: Combinatorial Search in DS
- Difficulty: Advanced
- Completed: 1 users
Creator
Pavel