Chinese README: cnREADME.md
In the summer of 2025, I restart the development of PyDyNet after two years. PyDyNet implemented a pure inference version of Llama3 (6-layer Transformer, vocab-size=32000). The implementation is inspired by the NumPy version and dataset available here. To run it, download the dataset into the llm/llama folder and execute:
>>> python -m llm.llama.infer
There was a boy named Timmy. He loved to play with hi toy and run around outside. One day, Timmy' mom asked him to help her with the laundry. Timmy didn't want to help because he wanted to play. But hi mom said, "Timmy, you need to help me. It' important to help out."
Timmy didn't want to help, but he knew he had to. So, he put on hi shoe and went outside to help hi mom. A they were folding the clothe, Timmy saw a big pile of laundry on the floor. He wanted to help, so he started to pick it up. But then, he accidentally knocked over a pile of clothe and they fell on him. Timmy wa okay, but he felt bad.
Hi mom saw what happened and said, "Timmy, you need to be more careful. You could have hurt yourself." Timmy felt bad and said sorry. Hi mom hugged him and said, "It' okay, accident happen. Let' clean up the laundry together." Timmy learned that it' important to be careful and help out when you need it.
Token count: 262, elapsed: 0.87s, 300 tokens/sWe also implemented a pure inference version of CLIP, inspired by the NumPy version and dataset available NPCLIP. To run it, imigrate data folder of MPCLIP into llm/clip folder and execute:
>>> python -m llm.clip.infer
Label probs: [0.000953   0.48176003 0.51728696]for the following image and query ["a fish", "a dog", "a cat"]
PyDyNet is a neural network framework implemented entirely in NumPy (with CuPy support since version 0.0.7, using the same API). Its syntax is inspired by PyTorch, and its structure is as follows:
graph LR
   N(numpy/cupy.ndarray)--Backend--> A(Tensor) --> ds(Dataset) ---> Data(DataLoader)---> Mission
   A  --Eager execution--> B(Basic operators:<br> add, exp, etc)
   B -.Autograd-.-> A
   B --> CO(Complex<br>operators)
   --> f(Function:<br>img2col, etc) 
   --> M(Basic Module:<br>Linear, etc)
   --> CM(Advanced Module: CNN, RNN, Transformer, etc)
   --> Mission(Learning task)
   A --> GD(Optimizer:<br> SGD, Adam, etc) ---> LS(lr_scheduler: <br>StepLR, etc)---> Mission
    Dashed lines indicate that users can disable automatic differentiation using no_grad.
Just
pip install pydynetor
git clone https://github.com/Kaslanarian/PyDyNet
cd PyDyNet
python setup.py installExamples can be found in the examples/pydynet directory, with equivalent PyTorch implementations in examples/pytorch. To run an example, use:
python -m examples.pydynet.xxxThe example autodiff1d.py demonstrates automatic differentiation by performing gradient descent on a one-dimensional convex function:
A multi-variable convex function example is provided in autodiff2d.py:
The example mlp_cnn.py uses MLP and LeNet to classify MNIST digits. The training and testing accuracies are shown below:
The example mlp_dropout_bn.py compares the performance of three networks on the fetch_olivetti_faces dataset (64×64 pixel images):
- Three-layer MLP;
- Three-layer MLP with Dropout;
- Three-layer MLP with Batch Normalization.
The example ts_prediction.py demonstrates time series prediction using a GRU:
The example transformer.py shows how to train a text classification model using a Transformer. The training results are as follows:
Dataset (CoLA) link: https://nyu-mll.github.io/CoLA/cola_public_1.1.zip
PyDyNet supports CUDA acceleration through CuPy. To use it, simply install CuPy and use the same API as NumPy. We compare the performance of PyDyNet with CuPy and NumPy as follows on Nvidia GeForce RTX 4090:
| Network structure | Dataset | CPU time (s) per epoch | GPU time (s) per epoch | 
|---|---|---|---|
| 3-layer MLP | MNIST (80000×574) | 7.256±0.138 | 1.203±.0181 | 
| LeNet | MNIST (80000×574) | 239.664±2.108 | 2.841±0.026 | 
| 1-layer Transformer (dim=512, head=4) | CoLA (8551×45×64) | 17.503±0.251 | 1.075±0.002 | 






