I spoke with my friend. He said his solution was very specific to whales and the specific data set he used. However, he told me that a paper, Toolbox For Animal Call Recognition, was very useful to him.
Here is the abstract and a method of purchasing access. You might be able to find free access another way.
This isn’t exactly going to be your catch all for determining what hardware you need. But in order to do that for a project like this it is probably worthwhile to define your problem in terms of capturing data (how, how much, how often, sample rate, storage methods), which algorithms you will use (expected library support, speed, RAM needed), and connectivity (how are you going to get the data out).
Maybe if you could get a proof of concept running in the full .NET that might be a good indicator of a product on this site working as well.
P.S. I’m not so experienced so if my advice seems off that’s probably because I’m spit balling