Machine Learning - Interview Questions
Beginner
- Bias-variance tradeoff: how do under/overfitting manifest and how do you address them? (See Model Selection)
- Data leakage: examples and prevention (pipelines, split before transform). (See Preprocessing)
- Metrics: accuracy vs F1 vs ROC-AUC vs PR-AUC; when to use each? (See Evaluation)
- Cross-validation: k-fold, stratified, group, time-series splits — when and why?
Intermediate
- Feature scaling/encoding with ColumnTransformer and leakage-safe target encoding.
- Class imbalance: thresholds, resampling, class weights, calibration.
- Hyperparameter search: grid vs randomized vs Bayesian; early stopping.
- Feature importance vs permutation importance; SHAP/LIME basics.
Advanced
- Fairness: equalized odds vs demographic parity; tradeoffs. (See Ethics)
- Productionization: online vs batch, blue/green/canary, shadow mode.
- Monitoring: data/label drift, concept drift, retraining triggers. (See Monitoring)
- Deep Learning: choosing CNN/RNN/Transformers for tasks; transfer learning.