I don't think you can mix cluster across layers. Within a layer no problem, but across two (or more) seems to go against the intent of the feature.
Out of curiosity, what leads you to having two layers? Is it not possible to combine them into one - and drive the distinction by something like color or icons or what not?