HRM Generalization Test

Just testing if HRM Generalizes better compared to others in Vision Task, here ViT based architecture is used. Patchify then flatten the image and then pass it to HRM.

Here I made my own HRM layer based on the paper. Using Recurrent mechanism to model the High level and low level cycles.
And the Initial results are promising. The smaller HRM model beats the larger Resnet model. On Mnist dataset with 1000 balanced samples. And when me and my Arjun tested on it with more epochs, HRM didn't seeminly overfit.
Recent findings by us show that HRM is not only good at generalization but also in robustness.

Can this be a new type of architecture that can change the world of Deep Learning?

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
HRMVision_metrics.json		HRMVision_metrics.json
README.md		README.md
Resnet_metrics.json		Resnet_metrics.json
hrm_testing.ipynb		hrm_testing.ipynb
main.ipynb		main.ipynb
tinyshakespear.txt		tinyshakespear.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HRM Generalization Test

About

Uh oh!

Releases

Packages

Languages

Rohit909-creator/Hierarchical_Reasoning_Model

Folders and files

Latest commit

History

Repository files navigation

HRM Generalization Test

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages