KV Cache Compression Will Cut Inference Costs 50% Before HBM4 Ships at Scale — Market Calls — MenFem | MenFem