Skip to content

Feature processor v3#5565

Draft
BassemHalim wants to merge 13 commits intoaws:masterfrom
BassemHalim:feature-processor-v3
Draft

Feature processor v3#5565
BassemHalim wants to merge 13 commits intoaws:masterfrom
BassemHalim:feature-processor-v3

Conversation

@BassemHalim
Copy link

@BassemHalim BassemHalim commented Feb 20, 2026

(this PR is a continuation of PR 5490

Port feature_processor module to SageMaker Python SDK v3

Migrates the feature_store.feature_processor module from sagemaker-python-sdk v2 to the v3 modular package structure (sagemaker-mlops), along with supporting changes in sagemaker-core.

Changes

Import path migration — All internal imports updated from sagemaker.feature_store.feature_processorsagemaker.mlops.feature_store.feature_processor, and external SDK dependencies remapped to their v3 locations:

  • sagemaker.Sessionsagemaker.core.helper.session_helper.Session
  • sagemaker.lineagesagemaker.core.lineage
  • sagemaker.remote_functionsagemaker.core.remote_function
  • sagemaker.workflowsagemaker.mlops.workflow
  • sagemaker.s3 / sagemaker.utils / sagemaker.vpc_utilssagemaker.core.*

Estimator → ModelTrainer migration (feature_scheduler.py) — Replaced the v2 Estimator dict-based construction with the v3 ModelTrainer API:

  • Uses Compute, Networking, StoppingCondition, SourceCode, OutputDataConfig, and Tag config objects
  • Creates a PipelineSession for pipeline-aware execution
  • TrainingStep now uses step_args from ModelTrainer.train() instead of estimator+inputs

Input channel format change (_config_uploader.py) — prepare_step_input now returns List[Channel] (using Channel/DataSource/S3DataSource shapes) instead of Dict[str, TrainingInput].

Session helper additions (sagemaker-core) — Added Feature Store methods to Session: delete_feature_group, describe_feature_group, create_feature_group, update_feature_group, and related config schema imports.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Aditi2424 and others added 12 commits February 4, 2026 11:58
* feat: Add Feature Store Support to V3

* Add feature store tests

---------

Co-authored-by: adishaa <adishaa@amazon.com>
- Add feature_store_functions_report.md documenting all 63 functions
  across feature_store module (excluding feature_processor)
- Add comprehensive unit tests for get_feature_group_as_dataframe:
  * Session handling (provided, from region, from role)
  * Error cases (missing session/region, missing event_time)
  * Latest ingestion logic with event time
  * Query string manipulation and table placeholder
  * Verbose and silent logging modes
  * Kwargs passing to as_dataframe
- Add comprehensive unit tests for prepare_fg_from_dataframe_or_file:
  * DataFrame and file path input handling
  * Session/region/role configuration
  * Record ID creation and validation
  * Event ID creation with timestamp
  * Duplicate record detection
  * Column name formatting
  * CSV kwargs passing
  * Feature definition loading
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants

Comments