Commit b7947c85 authored by Guolin Ke's avatar Guolin Ke Committed by GitHub
Browse files

Update Python_intro.md

parent b1e34d15
......@@ -30,9 +30,9 @@ The data is stored in a ```Dataset``` object.
#### To load a libsvm text file or a LightGBM binary file into ```Dataset```:
```python
train_data = lgb.Dataset('train.svm')
test_data = lgb.Dataset('test.svm.bin')
train_data = lgb.Dataset('train.svm.bin')
```
#### To load a numpy array into ```Dataset```:
```python
data = np.random.rand(500,10) # 500 entities, each contains 10 features
......@@ -49,6 +49,18 @@ train_data = lgb.Dataset(csr)
train_data = lgb.Dataset('train.svm.txt')
train_data.save_binary("train.bin")
```
#### Create validation data
```python
test_data = train_data.create_valid('test.svm')
```
or
```python
test_data = lgb.Dataset('test.svm', reference=train_data)
```
In LightGBM, the validation data should be aligned with training data.
#### Specific feature names and categorical features
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment