SharePoint isn't available at our office network for "random" time frames

%3CLINGO-SUB%20id%3D%22lingo-sub-1492901%22%20slang%3D%22en-US%22%3ESharePoint%20isn't%20available%20at%20our%20office%20network%20for%20%22random%22%20time%20frames%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-1492901%22%20slang%3D%22en-US%22%3E%3CP%3EWe%20have%20this%20issue%20since%20nearly%202%20years%20now%20and%20changed%20a%20lot%20in%20our%20infrastructure%2C%20got%20a%20new%20firewall%2C%20new%20switches%20and%20better%20bandwith%2C%20but%20it%20stayed%20the%20same.%3CBR%20%2F%3EWe%20noticed%20that%20those%20errors%20don't%20happen%20during%20the%20weekend%2C%20when%20no%20one%20is%20in%20the%20office%20and%20became%20less%20frequent%20during%20corona%2C%20probably%20because%20many%20workers%20are%20now%20working%20from%20home.%3CBR%20%2F%3ESites%20like%20%3CA%20href%3D%22http%3A%2F%2Fwww.office.com%22%20target%3D%22_blank%22%20rel%3D%22noopener%20noreferrer%22%3Ewww.office.com%3C%2FA%3E%20are%20still%20available%20fine%20during%20those%20fallouts%2C%20only%20our%20sharepoint%20page%20is%20not.%20The%20fallouts%20can%20be%20only%20a%20few%20seconds%20or%20even%20half%20an%20hour.%3C%2FP%3E%3CP%3EInformation%20that%20is%20probably%20relevant%3A%3C%2FP%3E%3CUL%3E%3CLI%3EWe%20use%20SharePoint%20online%2C%20not%20on-prem%3C%2FLI%3E%3CLI%3EOneDrive%20also%20doesn't%20work%20during%20the%20fallouts%3C%2FLI%3E%3CLI%3EOther%20Office%20products%20like%20MS%20Teams%20do%20work%3C%2FLI%3E%3CLI%3EThe%20SharePoint%20page%20is%20still%20available%20outside%20from%20our%20network%3C%2FLI%3E%3CLI%3EI%20can%20still%20ping%20and%20use%20trcrt%20with%20the%20IP%20during%20the%20fallouts%2C%20but%20can't%20reach%20it%20with%20telnet%3C%2FLI%3E%3CLI%3EChanging%20the%20DNS%20from%20the%20router%20didn't%20work%3C%2FLI%3E%3CLI%3EDuring%20the%20fallouts%2C%20we%20noticed%20long%20TCP%20Keep-Alive%20and%20Retransmission%20sequences%20while%20monitoring%20with%20wireshark%3C%2FLI%3E%3C%2FUL%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3CP%3EWhat%20would%20your%20approach%20be%20for%20this%20problem%3F%3C%2FP%3E%3C%2FLINGO-BODY%3E%3CLINGO-LABS%20id%3D%22lingo-labs-1492901%22%20slang%3D%22en-US%22%3E%3CLINGO-LABEL%3ESharePoint%20Online%3C%2FLINGO-LABEL%3E%3C%2FLINGO-LABS%3E%3CLINGO-SUB%20id%3D%22lingo-sub-1494321%22%20slang%3D%22en-US%22%3ERe%3A%20SharePoint%20isn't%20available%20at%20our%20office%20network%20for%20%22random%22%20time%20frames%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-1494321%22%20slang%3D%22en-US%22%3E%3CP%3E%3CA%20href%3D%22https%3A%2F%2Ftechcommunity.microsoft.com%2Ft5%2Fuser%2Fviewprofilepage%2Fuser-id%2F711548%22%20target%3D%22_blank%22%3E%40YvonneP%3C%2FA%3E%26nbsp%3B%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3CP%3ECheck%20the%20usage%20reports%20in%20the%20Admin%20centre%20and%20check%20anyone%20isn't%20syncing%20too%20much%20data.%26nbsp%3B%20You%20can%20sort%20by%20the%20amount%20of%20files.%26nbsp%3B%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3CP%3EIs%20it%20all%20site%20collections%20that%20are%20not%20available%20or%20all%20site%20collections%20at%20company.sharepoint.com%2Fsites%2F%3F%26nbsp%3B%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3CP%3EAll%20users%20on%20Windows%2010%3F%26nbsp%3B%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3CP%3EWhen%20this%20happens%20on%20the%20network%20can%20you%20access%20it%20from%20another%20network%3F%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3CP%3EI%20have%20had%20this%20happen%20quite%20a%20few%20times%20in%20the%20past%2C%20it%20always%20seemed%20to%20be%20a%20pattern%20where%20SharePoint%20would%20go%20down%20at%20a%20particular%20time%20for%20roughly%205%20minutes%20and%20then%20be%20slow%20for%20the%20rest%20of%20the%20day%20or%20would%20generally%20be%20slow%20in%20the%20mornings.%20In%20one%20instance%20it%20was%20related%20to%20another%20customer%20running%20some%20workloads%20in%20their%20off%20peak%20time%20but%20our%20on%20peak%20time.%20Second%20time%20around%20the%20database%20had%20a%20bug%20in%20it%20and%20had%20to%20be%20re-indexed%20or%20something%20similar%20(Microsoft%20won't%20tell%20you%20what%20is%20actually%20wrong%20once%20they%20fix%20it)%20.%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3CP%3EWhat%20I%20would%20do%20in%20this%20situation%20is%20next%20time%20it%20happens%20get%20some%20fiddler%20logs%20of%20your%20attempts%20to%20access%20various%20SharePoint%20sites%20and%20then%20log%20it%20as%20a%20ticket%20with%20Microsoft.%20They%20can%20see%20what%20is%20going%20on%20in%20the%20background%2C%20I%20wouldn't%20assume%20it%20is%20your%20infrastructure.%26nbsp%3B%3C%2FP%3E%3C%2FLINGO-BODY%3E%3CLINGO-SUB%20id%3D%22lingo-sub-1494325%22%20slang%3D%22en-US%22%3ERe%3A%20SharePoint%20isn't%20available%20at%20our%20office%20network%20for%20%22random%22%20time%20frames%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-1494325%22%20slang%3D%22en-US%22%3E%3CP%3E%3CA%20href%3D%22https%3A%2F%2Ftechcommunity.microsoft.com%2Ft5%2Fuser%2Fviewprofilepage%2Fuser-id%2F711548%22%20target%3D%22_blank%22%3E%40YvonneP%3C%2FA%3E%26nbsp%3B%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3CUL%3E%3CLI%3EDuring%20the%20fallouts%2C%20we%20noticed%20long%20TCP%20Keep-Alive%20and%20Retransmission%20sequences%20while%20monitoring%20with%20wireshark%3C%2FLI%3E%3C%2FUL%3E%3CP%3EIs%20that%20for%20all%20sites%2Ftraffic%20or%20just%20SharePoint%3F%26nbsp%3B%20What%20does%20network%20utilisation%20look%20like%20at%20this%20time%3F%26nbsp%3B%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3CP%3ECheck%20the%20Microsoft%20365%20usage%20reports%20in%20the%20admin%20centre%20and%20check%20if%20anyone%20is%20syncing%20large%20amounts%20of%20data%2Fnumber%20of%20files%20for%20SharePoint%20or%20OneDrive.%20Do%20you%20tell%20you%20users%20to%20sync%20document%20libraries%20as%20a%20default%20or%20only%20the%20files%20they%20need%3F%26nbsp%3B%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3CP%3EI%20had%20a%20query%20the%20other%20day%20where%20the%20customer%20had%201%20document%20library%2C%20300GB%20and%20320%2C000%20files%20and%20all%20user%20were%20syncing%20the%20data%20and%20it%20was%20killing%20every%20laptop.%20They%20were%20doing%20this%20because%20they%20always%20used%20to%20have%20every%20file%20accessible%20in%20file%20explorer.%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3C%2FLINGO-BODY%3E
Occasional Contributor

We have this issue since nearly 2 years now and changed a lot in our infrastructure, got a new firewall, new switches and better bandwith, but it stayed the same.
We noticed that those errors don't happen during the weekend, when no one is in the office and became less frequent during corona, probably because many workers are now working from home.
Sites like www.office.com are still available fine during those fallouts, only our sharepoint page is not. The fallouts can be only a few seconds or even half an hour.

Information that is probably relevant:

  • We use SharePoint online, not on-prem
  • OneDrive also doesn't work during the fallouts
  • Other Office products like MS Teams do work
  • The SharePoint page is still available outside from our network
  • I can still ping and use trcrt with the IP during the fallouts, but can't reach it with telnet
  • Changing the DNS from the router didn't work
  • During the fallouts, we noticed long TCP Keep-Alive and Retransmission sequences while monitoring with wireshark

 

What would your approach be for this problem?

1 Reply

@YvonneP 

 

  • During the fallouts, we noticed long TCP Keep-Alive and Retransmission sequences while monitoring with wireshark

Is that for all sites/traffic or just SharePoint?  What does network utilisation look like at this time? 

 

Check the Microsoft 365 usage reports in the admin centre and check if anyone is syncing large amounts of data/number of files for SharePoint or OneDrive. Do you tell you users to sync document libraries as a default or only the files they need? 

 

I had a query the other day where the customer had 1 document library, 300GB and 320,000 files and all user were syncing the data and it was killing every laptop. They were doing this because they always used to have every file accessible in file explorer.