I’ve noticed that CATap appears to scale the stereo/mono mixdown based on the number of channels in the output audio device. For example, if the selected output device has 4 channels, the tapped audio is reduced by half. This behavior does not seem to be documented anywhere, but I’m assuming it may have been done to reduce the chance of clipping when downmixing. The issue is that users are complaining that the audio level is too low. Is there any workaround for this that does not involve tapping every audio device separately and doing my own mixdown? Ideally, I’d like to keep using the system tap/mix behavior but avoid the automatic level reduction, or at least compensate for it in a reliable way.
This is an issue we're aware of with stereo and mixdown taps when capturing from devices with more than two channels: FB13479345
Unfortunately we do not currently have a workaround beyond doing your own mixdown from a device format tap.
Even though we're aware of this issue, we still encourage you to open a bug report, and post the FB number here once you do. The specific info you include your bug report might help our investigation, and filing the bug report allows you to get notified when it is resolved.