You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If the dataset has pickle files, you need to pay attention to the following security risks.
123
+
124
+
- Loading Pickle files will lead to the following risks:
125
+
- Remote Code Execution (RCE)
126
+
- Sensitive data leakage
127
+
- System compromise
128
+
- By using pickle files, you acknowledge the risks and agree to:
129
+
- Covert pickle files in isolated environments (e.g., sandbox/container)
130
+
- Never load `.pkl` files from untrusted sources
131
+
132
+
For more information, review the [Documentation](https://docs.python.org/3/library/pickle.html) for the ``pickle`` module.
133
+
134
+
Taking the following training dataset as an example:
135
+
```shell
136
+
unzip training_examples.zip && tree training_examples
137
+
```
138
+
```text
139
+
training_examples
140
+
├─input
141
+
│ └──uid_0
142
+
│ ├─000.png
143
+
│ └──meta.pkl
144
+
├─target
145
+
│ └──uid_0
146
+
│ └──000.png
147
+
└──uid_set.pkl
148
+
```
149
+
The data type stored in ``meta.pkl`` is `List[np.ndarray]`, and the data type stored in ``uid_set.pkl`` is `List[str]`.
150
+
We recommend that you convert the ``meta.pkl`` file to ``meta.npz`` format and convert the ``uid_set.pkl`` file to ``uid_set.json`` format by yourself before training.
151
+
121
152
### Data Curation
122
153
Following the original paper, we used Blender to render multiview frames for a 3D object in `.obj` for training. Typically for overfitting, three 3D objects from the objaverse dataset are used. We rendered 5 arbitral views for each object with the corresponding camera parameters extracted.
0 commit comments