easy_rec.python.input¶
easy_rec.python.input.input¶
- class easy_rec.python.input.input.Input(data_config, feature_configs, input_path, task_index=0, task_num=1, check_mode=False, pipeline_config=None, **kwargs)[source]¶
Bases:
object
- DATA_OFFSET = 'DATA_OFFSET'¶
- __init__(data_config, feature_configs, input_path, task_index=0, task_num=1, check_mode=False, pipeline_config=None, **kwargs)[source]¶
- property num_epochs¶
- create_multi_placeholders(export_config)[source]¶
Create multiply placeholders on export, one for each feature.
- Parameters:
export_config – ExportConfig instance.
- classmethod create_class(name)¶
easy_rec.python.input.csv_input¶
- class easy_rec.python.input.csv_input.CSVInput(data_config, feature_config, input_path, task_index=0, task_num=1, check_mode=False, pipeline_config=None)[source]¶
Bases:
Input
- __init__(data_config, feature_config, input_path, task_index=0, task_num=1, check_mode=False, pipeline_config=None)[source]¶
- classmethod create_class(name)¶
easy_rec.python.input.csv_input_v2¶
- class easy_rec.python.input.csv_input_v2.CSVInputV2(data_config, feature_config, input_path, task_index=0, task_num=1, check_mode=False, pipeline_config=None)[source]¶
Bases:
Input
- __init__(data_config, feature_config, input_path, task_index=0, task_num=1, check_mode=False, pipeline_config=None)[source]¶
- classmethod create_class(name)¶
easy_rec.python.input.kafka_input¶
- class easy_rec.python.input.kafka_input.KafkaInput(data_config, feature_config, kafka_config, task_index=0, task_num=1, check_mode=False, pipeline_config=None)[source]¶
Bases:
Input
- DATA_OFFSET = 'DATA_OFFSET'¶
- __init__(data_config, feature_config, kafka_config, task_index=0, task_num=1, check_mode=False, pipeline_config=None)[source]¶
- classmethod create_class(name)¶
easy_rec.python.input.odps_input¶
- class easy_rec.python.input.odps_input.OdpsInput(data_config, feature_config, input_path, task_index=0, task_num=1, check_mode=False, pipeline_config=None)[source]¶
Bases:
Input
- __init__(data_config, feature_config, input_path, task_index=0, task_num=1, check_mode=False, pipeline_config=None)[source]¶
- classmethod create_class(name)¶
easy_rec.python.input.odps_input_v2¶
- class easy_rec.python.input.odps_input_v2.OdpsInputV2(data_config, feature_config, input_path, task_index=0, task_num=1, check_mode=False, pipeline_config=None)[source]¶
Bases:
Input
- __init__(data_config, feature_config, input_path, task_index=0, task_num=1, check_mode=False, pipeline_config=None)[source]¶
- classmethod create_class(name)¶
easy_rec.python.input.odps_rtp_input¶
- class easy_rec.python.input.odps_rtp_input.OdpsRTPInput(data_config, feature_config, input_path, task_index=0, task_num=1, check_mode=False, pipeline_config=None)[source]¶
Bases:
Input
RTPInput for parsing rtp fg new input format on odps.
- Our new format(csv in table) of rtp output:
label0, item_id, …, user_id, features
- For the feature column, features are separated by ,
multiple values of one feature are separated by , such as: …20beautysmartParis…
- The features column and labels are specified by data_config.selected_cols,
columns are selected by names in the table such as: clk,features, the last selected column is features, the first selected columns are labels
- __init__(data_config, feature_config, input_path, task_index=0, task_num=1, check_mode=False, pipeline_config=None)[source]¶
- classmethod create_class(name)¶
easy_rec.python.input.rtp_input¶
- class easy_rec.python.input.rtp_input.RTPInput(data_config, feature_config, input_path, task_index=0, task_num=1, check_mode=False, pipeline_config=None)[source]¶
Bases:
Input
RTPInput for parsing rtp fg new input format.
- Our new format(csv in csv) of rtp output:
label0, item_id, …, user_id, features
here the separator(,) could be specified by data_config.rtp_separator For the feature column, features are separated by ,
multiple values of one feature are separated by , such as: …20beautysmartParis…
- The features column and labels are specified by data_config.selected_cols,
columns are selected by indices as our csv file has no header, such as: 0,1,4, means the 4th column is features, the 1st and 2nd columns are labels
- __init__(data_config, feature_config, input_path, task_index=0, task_num=1, check_mode=False, pipeline_config=None)[source]¶
- classmethod create_class(name)¶
easy_rec.python.input.rtp_input_v2¶
- class easy_rec.python.input.rtp_input_v2.RTPInputV2(data_config, feature_config, input_path, task_index=0, task_num=1, check_mode=False, pipeline_config=None)[source]¶
Bases:
Input
RTPInput for parsing rtp fg input format.
the original rtp format, it is not efficient for training, the performance have to be tuned.
- __init__(data_config, feature_config, input_path, task_index=0, task_num=1, check_mode=False, pipeline_config=None)[source]¶
- classmethod create_class(name)¶