paddlenlp.datasets.dataset.po 9.06 KB
Newer Older
yuguo-Jack's avatar
yuguo-Jack committed
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
# SOME DESCRIPTIVE TITLE.
# Copyright (C) 2021, PaddleNLP
# This file is distributed under the same license as the PaddleNLP package.
# FIRST AUTHOR <EMAIL@ADDRESS>, 2022.
#
#, fuzzy
msgid ""
msgstr ""
"Project-Id-Version: PaddleNLP \n"
"Report-Msgid-Bugs-To: \n"
"POT-Creation-Date: 2022-03-18 21:31+0800\n"
"PO-Revision-Date: YEAR-MO-DA HO:MI+ZONE\n"
"Last-Translator: FULL NAME <EMAIL@ADDRESS>\n"
"Language-Team: LANGUAGE <LL@li.org>\n"
"MIME-Version: 1.0\n"
"Content-Type: text/plain; charset=utf-8\n"
"Content-Transfer-Encoding: 8bit\n"
"Generated-By: Babel 2.9.0\n"

#: ../source/paddlenlp.datasets.dataset.rst:2
msgid "dataset"
msgstr ""

#: of paddlenlp.datasets.dataset.MapDataset:1
msgid ""
"Wraps a map-style dataset-like object as an instance of `MapDataset`, and"
" equips it with `map` and other utility methods. All non-magic methods of"
" the raw object are also accessible."
msgstr ""

#: of paddlenlp.datasets.dataset.DatasetBuilder.read
#: paddlenlp.datasets.dataset.IterDataset
#: paddlenlp.datasets.dataset.IterDataset.filter
#: paddlenlp.datasets.dataset.IterDataset.map
#: paddlenlp.datasets.dataset.IterDataset.shard
#: paddlenlp.datasets.dataset.MapDataset
#: paddlenlp.datasets.dataset.MapDataset.filter
#: paddlenlp.datasets.dataset.MapDataset.map
#: paddlenlp.datasets.dataset.load_dataset
msgid "参数"
msgstr ""

#: of paddlenlp.datasets.dataset.MapDataset:5
msgid ""
"An object with `__getitem__` and `__len__` methods. It could be a list or"
" a subclass of `paddle.io.Dataset`."
msgstr ""

#: of paddlenlp.datasets.dataset.IterDataset:8
#: paddlenlp.datasets.dataset.MapDataset:8
msgid "Other information to be passed to the dataset."
msgstr ""

#: of paddlenlp.datasets.dataset.IterDataset:11
#: paddlenlp.datasets.dataset.MapDataset:11
msgid ""
"For examples of this class, please see `dataset_self_defined "
"<https://paddlenlp.readthedocs.io/zh/latest/data_prepare/dataset_self_defined.html>`__."
msgstr ""

#: of paddlenlp.datasets.dataset.IterDataset.filter:1
#: paddlenlp.datasets.dataset.MapDataset.filter:1
msgid ""
"Filters samples by the filter function and uses the filtered data to "
"update this dataset."
msgstr ""

#: of paddlenlp.datasets.dataset.MapDataset.filter:4
msgid ""
"A filter function that takes a sample as input and returns a boolean. "
"Samples that return False would be discarded."
msgstr ""

#: of paddlenlp.datasets.dataset.MapDataset.filter:7
msgid ""
"Number of processes for multiprocessing. If set to 0, it doesn't use "
"multiprocessing. Defaults to `0`."
msgstr ""

#: of paddlenlp.datasets.dataset.IterDataset.map:1
#: paddlenlp.datasets.dataset.MapDataset.map:1
msgid ""
"Performs specific function on the dataset to transform and update every "
"sample."
msgstr ""

#: of paddlenlp.datasets.dataset.MapDataset.map:3
msgid ""
"Transformations to be performed. It receives single sample as argument if"
" batched is False. Else it receives all examples."
msgstr ""

#: of paddlenlp.datasets.dataset.MapDataset.map:6
msgid ""
"If True, transformations would be delayed and performed on demand. "
"Otherwise, transforms all samples at once. Note that if `fn` is "
"stochastic, `lazy` should be True or you will get the same result on all "
"epochs. Defaults to False."
msgstr ""

#: of paddlenlp.datasets.dataset.MapDataset.map:11
msgid ""
"If True, transformations would take all examples as input and return a "
"collection of transformed examples. Note that if set True, `lazy` option "
"would be ignored. Defaults to False."
msgstr ""

#: of paddlenlp.datasets.dataset.MapDataset.map:15
msgid ""
"Number of processes for multiprocessing. If set to 0, it doesn't use "
"multiprocessing. Note that if set to positive value, `lazy` option would "
"be ignored. Defaults to 0."
msgstr ""

#: of paddlenlp.datasets.dataset.DatasetBuilder:1
msgid ""
"A base class for all DatasetBuilder. It provides a `read()` function to "
"turn a data file into a MapDataset or IterDataset."
msgstr ""

#: of paddlenlp.datasets.dataset.DatasetBuilder:4
msgid ""
"`_get_data()` function and `_read()` function should be implemented to "
"download data file and read data file into a `Iterable` of the examples."
msgstr ""

#: of paddlenlp.datasets.dataset.DatasetBuilder:7
msgid ""
"For how to define a custom `DatasetBuilder`, please see "
"`contribute_dataset "
"<https://paddlenlp.readthedocs.io/zh/latest/community/contribute_dataset.html>`__."
msgstr ""

#: of paddlenlp.datasets.dataset.DatasetBuilder.read:1
msgid ""
"Returns a dataset containing all the examples that can be read from the "
"file path."
msgstr ""

#: of paddlenlp.datasets.dataset.DatasetBuilder.read:3
msgid ""
"If `self.lazy` is False, this eagerly reads all instances from "
"`self._read()` and returns a `MapDataset`."
msgstr ""

#: of paddlenlp.datasets.dataset.DatasetBuilder.read:6
msgid ""
"If `self.lazy` is True, this returns an `IterDataset`, which internally "
"relies on the generator created from `self._read()` to lazily produce "
"examples. In this case your implementation of `_read()` must also be lazy"
" (that is, not load all examples into memory at once)."
msgstr ""

#: of paddlenlp.datasets.dataset.DatasetBuilder.read:11
msgid "Path of data file to read, usually provided by `_get_data` function."
msgstr ""

#: of paddlenlp.datasets.dataset.DatasetBuilder.read:14
msgid ""
"The split name of selected dataset. This only makes a different when data"
" files of different splits have different structures."
msgstr ""

#: of paddlenlp.datasets.dataset.DatasetBuilder.read
#: paddlenlp.datasets.dataset.load_dataset
msgid "返回"
msgstr ""

#: of paddlenlp.datasets.dataset.DatasetBuilder.read:18
msgid "A `MapDataset|IterDataset`."
msgstr ""

#: of paddlenlp.datasets.dataset.DatasetBuilder.get_labels:1
msgid "Returns list of class labels of the dataset if specified."
msgstr ""

#: of paddlenlp.datasets.dataset.DatasetBuilder.get_vocab:1
msgid "Returns vocab file path of the dataset if specified."
msgstr ""

#: of paddlenlp.datasets.dataset.IterDataset:1
msgid ""
"Wraps a dataset-like object as an instance of `IterDataset`, and equips "
"it with `map` and other utility methods. All non-magic methods of the raw"
" object also accessible."
msgstr ""

#: of paddlenlp.datasets.dataset.IterDataset:5
msgid ""
"An object with `__iter__` function. It can be a Iterable or a subclass of"
" `paddle.io.IterableDataset`."
msgstr ""

#: of paddlenlp.datasets.dataset.IterDataset.filter:4
msgid ""
"A filter function that takes a sample as input and returns a boolean. "
"Samples that return False are discarded."
msgstr ""

#: of paddlenlp.datasets.dataset.IterDataset.shard:1
msgid "Split the dataset into `num_shards` pieces."
msgstr ""

#: of paddlenlp.datasets.dataset.IterDataset.shard:3
msgid ""
"An integer representing the number of data shards. If None, `num_shards` "
"would be number of trainers. Defaults to None."
msgstr ""

#: of paddlenlp.datasets.dataset.IterDataset.shard:7
msgid ""
"An integer representing the index of the current shard. If None, `index` "
"would be the current trainer rank id. Defaults to None."
msgstr ""

#: of paddlenlp.datasets.dataset.IterDataset.map:3
msgid "Transformations to be performed. It receives single sample as argument."
msgstr ""

#: of paddlenlp.datasets.dataset.load_dataset:1
msgid ""
"This method will load a dataset, either form PaddleNLP library or from a "
"self-defined data loading script, by calling functions in "
"`DatasetBuilder`."
msgstr ""

#: of paddlenlp.datasets.dataset.load_dataset:4
msgid ""
"For all the names of datasets in PaddleNLP library, see here:  "
"`dataset_list "
"<https://paddlenlp.readthedocs.io/zh/latest/data_prepare/dataset_list.html>`__."
msgstr ""

#: of paddlenlp.datasets.dataset.load_dataset:7
msgid "Either `splits` or `data_files` must be specified."
msgstr ""

#: of paddlenlp.datasets.dataset.load_dataset:9
msgid ""
"Name of the dataset processing script in PaddleNLP library or a custom "
"data reading function."
msgstr ""

#: of paddlenlp.datasets.dataset.load_dataset:12
msgid "Additional name to select a more specific dataset. Defaults to None."
msgstr ""

#: of paddlenlp.datasets.dataset.load_dataset:15
msgid ""
"Defining the path of dataset files. If None. `splits` must be specified. "
"Defaults to None."
msgstr ""

#: of paddlenlp.datasets.dataset.load_dataset:18
msgid ""
"Which split of the data to load. If None. `data_files` must be specified."
" Defaults to None."
msgstr ""

#: of paddlenlp.datasets.dataset.load_dataset:21
msgid ""
"Weather to return `MapDataset` or an `IterDataset`. True for "
"`IterDataset`. False for `MapDataset`. If None, return the default type "
"of this dataset. Defaults to None."
msgstr ""

#: of paddlenlp.datasets.dataset.load_dataset:25
msgid "Other keyword arguments to be passed to the `DatasetBuilder`."
msgstr ""

#: of paddlenlp.datasets.dataset.load_dataset:28
msgid "A `MapDataset` or `IterDataset` or a tuple of those."
msgstr ""

#: of paddlenlp.datasets.dataset.load_dataset:30
msgid ""
"For how to use this function, please see `dataset_load "
"<https://paddlenlp.readthedocs.io/zh/latest/data_prepare/dataset_load.html>`__"
" and `dataset_self_defined "
"<https://paddlenlp.readthedocs.io/zh/latest/data_prepare/dataset_self_defined.html>`__"
msgstr ""