Github sources
When chess movegen performance matters. C and magic bitboards.
But it is not GPU cuda tensors based, nor TPUs. Actually, depends on usage and tasks. Avoid python based position processing anyway. Just there is a way to write something in C too.
In python:
import chelpy
chelpy.ucimove("e2e4")
chelpy.movegen()
...
For most usual cases please use the original chess library , already included in python. It is much more developed and supported. Also includes functionality for endgame tables lookups. Community and well documented sources. Especially using endgame tablebases (wiki), they are various kinds. Sources are python PyPI scripts. Kinda fast and windows or mac compatible.