On the surface this seems like it's trading one problem for another. When users are on VPN we don't want to overload those devices or connections but when users are on-prem we don't want to overload the internet circuit. If I understand this article and the corresponding "updates for remote workers" article correctly, you're removing the ability to use Distribution Points to optimize internet traffic from sites with numerous clients. Is there a single approach that optimizes both scenarios?
For example, does this still honor Peer-Caching? Could that be used on-prem so that it would get pulled "once" from the CDN but then shared with peers? It would be nice to see a comprehensive guide to bandwidth optimization that accounts for multiple scenarios rather than focusing on one scenario (potentially) at the cost of others.