Function mindspore::dataset::MindData

Defined in File datasets.h

Function Documentation

inline std::shared_ptr<MindDataDataset> mindspore::dataset::MindData(const std::string &dataset_file, const std::vector<std::string> &columns_list, const std::reference_wrapper<Sampler> sampler, nlohmann::json *padded_sample = nullptr, int64_t num_padded = 0, ShuffleMode shuffle_mode = ShuffleMode::kGlobal, const std::shared_ptr<DatasetCache> &cache = nullptr)

Function to create a MindDataDataset.

Parameters

dataset_file – [in] File name of one component of a mindrecord source. Other files with identical source in the same path will be found and loaded automatically.
columns_list – [in] List of columns to be read.
sampler – [in] Sampler object used to choose samples from the dataset. supported sampler list: SubsetRandomSampler, PkSampler, RandomSampler, SequentialSampler, DistributedSampler.
padded_sample – [in] Samples will be appended to dataset, where keys are the same as column_list.
num_padded – [in] Number of padding samples. Dataset size plus num_padded should be divisible by num_shards.
shuffle_mode – [in] The mode for shuffling data every epoch (Default=ShuffleMode::kGlobal). Can be any of: ShuffleMode::kFalse - No shuffling is performed. ShuffleMode::kFiles - Shuffle files only. ShuffleMode::kGlobal - Shuffle both the files and samples. ShuffleMode::kInfile - Shuffle samples in file.
cache – [in] Tensor cache to use (default=nullptr which means no cache is used).

Returns

Shared pointer to the MindDataDataset.