"test/srt/test_deepseek_v32_mtp.py" did not exist on "48473684cc3e3d080fca85b089375700788f2d7a"
-
kahmed10 authored
This PR allows for other values of epsilon to be matched when finding layernorm. Similarly, the calculation now uses the variable for epsilon.
d9578ba6