"python-package/vscode:/vscode.git/clone" did not exist on "cf0a992eda609cce45d032c5b737f74949fdb9d3"
Commit b7947c85 authored by Guolin Ke's avatar Guolin Ke Committed by GitHub
Browse files

Update Python_intro.md

parent b1e34d15
...@@ -30,9 +30,9 @@ The data is stored in a ```Dataset``` object. ...@@ -30,9 +30,9 @@ The data is stored in a ```Dataset``` object.
#### To load a libsvm text file or a LightGBM binary file into ```Dataset```: #### To load a libsvm text file or a LightGBM binary file into ```Dataset```:
```python ```python
train_data = lgb.Dataset('train.svm') train_data = lgb.Dataset('train.svm.bin')
test_data = lgb.Dataset('test.svm.bin')
``` ```
#### To load a numpy array into ```Dataset```: #### To load a numpy array into ```Dataset```:
```python ```python
data = np.random.rand(500,10) # 500 entities, each contains 10 features data = np.random.rand(500,10) # 500 entities, each contains 10 features
...@@ -49,6 +49,18 @@ train_data = lgb.Dataset(csr) ...@@ -49,6 +49,18 @@ train_data = lgb.Dataset(csr)
train_data = lgb.Dataset('train.svm.txt') train_data = lgb.Dataset('train.svm.txt')
train_data.save_binary("train.bin") train_data.save_binary("train.bin")
``` ```
#### Create validation data
```python
test_data = train_data.create_valid('test.svm')
```
or
```python
test_data = lgb.Dataset('test.svm', reference=train_data)
```
In LightGBM, the validation data should be aligned with training data.
#### Specific feature names and categorical features #### Specific feature names and categorical features
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment