Added custom FMHA codegen receipt for TransformerEngine
(#6867)
## Motivation
TE uses AITER to build static MHA libraries, which ultimately rely on CK
kernels. We use the `600` receipt which generates more kernels than TE
truly needs. This bespoke receipt allows us to minimize the kernel
count, compile time, and memory footprint of our MHA library.
## Technical Details
Extended the receipt mechanism to include a custom `700` receipt for
TE's needs
## Test Plan
Test by building TE using the same receipt profile
## Test Result
Build validated in TE using a custom feature branches of AITER/CK to
temporarily apply the patch
## Submission Checklist
- [ ] Look over the contributing guidelines at
https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.