SlideShare a Scribd company logo
Online Conference
June 17th and 18th 2015
WWW.SPBIZCONF.COM
7 Steps to a Creating a Successful SharePoint
Recovery Plan
WWW.SPBIZCONF.COM
Paul LaPorte
Metalogix
Email : plaporte@Metalogix.com
LinkedIn :
linkedin.com/in/paullaporte
Expert in SaaS, hybrid and cloud business models,
business continuity, disaster recovery and security
Co-author: RBS for Dummies, 2013 Edition
Former global manager for SaaS solutions at Proofpoint
and senior executive of Evergreen Assurance, a pioneer
in real-time disaster recovery for mission critical
applications
WWW.SPBIZCONF.COM
Wasatch Mountain Ski Club, 1912
WWW.SPBIZCONF.COM
Wasatch Mountain Ski Club, 2011
Online Conference
June 17th and 18th 2015
Change is Inevitable
WWW.SPBIZCONF.COM
Ready for Recovery
• What is your backup strategy or approach?
• Do you have a recovery plan?
• Have you defined recovery objectives?
• Is your plan tested?
• Is your backup even tested?
WWW.SPBIZCONF.COM
Recovery – When, Not If
48% of organizations have had to execute their DR plans1
99.7% of SharePoint organizations have had to recover – some frequently2
• From lost or corrupted content, deleted sites, failed hard disks, etc…
• And the other 0.3% didn’t understand the question
Organizations experienced data loss or downtime for many reasons3
• 29% due to data corruption
• 26% due to accidental user error
• 23% due to security breech
3 in 5 companies don’t have a consistent way to
measure potential data loss
1. Symantec survey of 900 IT managers, http://www.computerweekly.com/news/2240083547/Disaster-recovery-plans-fail
2. 2014 Metalogix Business Continuity Survey750 SharePoint organizations surveyed
3. EMC 2014 global user survey, 3,300 businesses > 250 employees
WWW.SPBIZCONF.COM
Learning Objectives
Why a recovery plan is
critical to your job
How to make a
successful backup-and-
restore plan
What is your peer
group doing for SLAs
What do you do next
WWW.SPBIZCONF.COM
Case Study: Disaster at Sea
• Shipping company
• Several planned Dry Dock events
• Logistics Application in SharePoint tracks employees as the travel
and work at ports
– Supplier Companies and Vendors access to confirm travel and
dates.
– Access via Extranet
– Company uses this to track and report
• Project was 90% complete. The odd bug and some identified UI
issues remained.
WWW.SPBIZCONF.COM
Case Study: Disaster at Sea
• Line up for internal Governance Review to GO LIVE /
Production
• App was not hardened for Back Up or Recovery. SLA
was draft
• Non Project rogue employee convinced business
unit decided to do POP
• Dev environment received Production traffic
 Major SharePoint crash
• Loss of Time
• Loss of Data
• Confusion and blame game
WWW.SPBIZCONF.COM
Recover Point Objective (RPO)
• Defined: the maximum tolerable time period in which data
might be lost due to a server farm failure
• Example: 4 hour RPO means that SharePoint must be
backed at least every four hours
Mission Critical SharePoint Organizations Require
More Aggressive Recover Point Objectives (RPO)
WWW.SPBIZCONF.COM
SharePoint Backup Dilemma
Content Grows
WWW.SPBIZCONF.COM
SharePoint Backup Dilemma
Content Grows
Longer Backups
WWW.SPBIZCONF.COM
SharePoint Backup Dilemma
Content Grows
Longer Backups
More Risk of Data Loss
WWW.SPBIZCONF.COM
SharePoint Backup Dilemma
Content Grows
Longer Backups
More Risk of Data Loss
RPO
WWW.SPBIZCONF.COM
Backup Delays Cause Missed RPOs
Time to back up
9 hours
8 hours
7 hours
6 hours
5 hours
4 hours
3 hours
2 hours
1 hour
Recover
Point
Objective
Time to back up 1
TB Content
Database
WWW.SPBIZCONF.COM
Backup Delays Cause Missed RPOs
Time to back up
9 hours
8 hours
7 hours
6 hours
5 hours
4 hours
3 hours
2 hours
1 hour
Recover
Point
Objective
Time to back up 1
TB Content
Database
Takes up to 8 hours to
backup 1 TB database
If RPO is 4 hours, have
exceeded SLA by 4 hours!
WWW.SPBIZCONF.COM
Your Role
Disaster Recovery
vs.
Backup and Restore
WWW.SPBIZCONF.COM
Recovery Time Objective (RTO)
• Defined: the maximum time allowed for your
environment to be restored after an outage or data loss
• Example: 2 hour RTO means that data or farm must be
restore within 2 hours of the system outage.
Users/Organizations Demand Low RTO due to
Critical Nature of Content.
WWW.SPBIZCONF.COM
Different RTOs for SharePoint
Full SharePoint recovery
 Access to content vs. Access to SharePoint
Restores
 Farm
 Site
 Item
WWW.SPBIZCONF.COM
Where Do I Start?
WWW.SPBIZCONF.COM
Step 1: Start With Business Goals
• Consolidate and manage all collaboration content in a
single application platform
• Enable all users to work together online
• Reign in all unsanctioned content storage
• Minimize downtime and lost productivity
• Add key business partners into workflow
WWW.SPBIZCONF.COM
Step 2: Get an Executive Sponsor
Legitimize
Socialize
Support
Budget
Align
Protect
WWW.SPBIZCONF.COM
Step 3: Define Your SLA
What are your Recovery Objectives?
 How much downtime per event?
 How much downtime per month?
 How much content can be at risk?
 Which content needs most frequent backups?
 Which content needs to be recovered the quickest?
 What does SharePoint downtime cost the company?
 Do I need to be able to recover SharePoint without dependencies?
WWW.SPBIZCONF.COM
Step 4: Create a Recovery Plan
 Documented ownership
 Tasks
 Responsibilities
 Demarcation points
 Handoffs
A Living, Breathing Document
WWW.SPBIZCONF.COM
Step 5: Analysis Content
 Analyze current environment
 Risk profiles
 Impact of downtime
 Content categorization for risk and impact
 Backup requirements per category
Not All Content is Created Equal
WWW.SPBIZCONF.COM
Step 6: Validate
 Documented
 Signed-off solution
 Full tested
 Review cycle
You Want Me to Prove This?
WWW.SPBIZCONF.COM
Step 7: Testing
 Change is inevitable
 Change is often undetected
 Fire drills detect change
 Update your recovery plans and processes
Ongoing Testing is Mandatory
WWW.SPBIZCONF.COM
Ongoing Testing is Mandatory
• Only 1 in 4 organizations have run a SharePoint recovery test2
– Half who test don't document the test results
– Only one-third who test determine if they can achieve SLAs – (8% of all
organizations!)
• When companies test, 70% do not pass their own tests1
• Only 7.5% of companies tested and were successful in a
SharePoint recovery
• >2% of companies tested and required no change to the
plans following the test
• 93% of organizations had to change their disaster recovery
plans following tests.3
1. Disaster Recovery Preparedness Council, 2013
2. 2014 Metalogix Business Continuity Survey750 SharePoint organizations surveyed
3. http://www.computerweekly.com/news/1280090062/Companies-fail-to-test-disaster-recovery-plans
WWW.SPBIZCONF.COM
Executive
Sponsor
SLAs
Document
Risk
Analysis
Sign-Offs
Fire Drills
Update
Plan
Repeat For Success
WWW.SPBIZCONF.COM
BLOBs: The Root of SharePoint Backup Problems
BLOB = Binary Large Object
BLOB = binary representation of a file stored
in SQL Server (content database)
SharePoint content consists of structured
data (metadata) and unstructured data
(BLOBs)
BLOBs are immutable
BLOBs are created and deleted but never
updated
WWW.SPBIZCONF.COM
Externalizing BLOBs Makes Content Databases Really Small
Time to back up
9 hours
8 hours
7 hours
6 hours
5 hours
4 hours
3 hours
2 hours
1 hour
RecoverPoint
Objective
1 TB Content
Database shrinks to
50 GB
 Externalizing BLOBs makes achieving
Recover Point Objective easy.
 1 TB content database becomes 50GB
 BLOBs continuously backed up by
StoragePoint
 Content DB Automatically backed up by
StoragePoint
WWW.SPBIZCONF.COM
StoragePoint Shatters Limitations
 Quick install and easy management within
SharePoint
 Wide range of connectors for on premise and
Cloud storage providers
 Integrated with SharePoint Backup for
lightening-fast backups.
#1 SharePoint Storage Externalization
• Up to 98% reduction in SQL size
• Expand storage and slash storage costs
• Over 100% faster uploads and downloads
WWW.SPBIZCONF.COM
For More Information: 7 Steps eBook
http://www.metalogix.com/Promotions/SharePoint-Backup/WhitePapers-and-
ebooks/7-steps-to-creating-a-successful-recovery-plan.
WWW.SPBIZCONF.COM
What Do I Do Next?
www.Metalogix.com/RecoveryChecklist
WWW.SPBIZCONF.COM
Super, Extra Bonus Material
How Disaster Recovery and Continuity Planning
are Changing with the Cloud
WWW.SPBIZCONF.COM
Key Metrics
• Backup & Recovery
– RPO
– RTO
• Cloud Service Provider
– SLA
– Uptime
– MTBF – Mean Time Between Failures
– MTTR – Mean Time To Recover
• Incident Results
– Keep track of these too!
WWW.SPBIZCONF.COM
SharePoint Online Recovery Options
SharePoint Online -- 4 Different ways you recover data
• Using the Recycle bins and version control
• Third party tools that can back up your Data
• Service request through portal
• Make a manual backup of your Sites or Document library
WWW.SPBIZCONF.COM
User Recycle bin (First stage Recycle bin)
• 90 Days. Previously was 30 days
Site Collection Recycle Bin (Second stage Recycle bin)
• Max time in 1st and 2nd stage recycle bins is 90 days
• Also contains deleted Sites
Version Control
• Default option is “Off” - good idea to enable it
• Versioning takes additional room – but is the best way to get item level recovery. Take the
storage hit
• No limit on versions. Deleting (or restoring) one version deletes all associated versions
Using the Recycle Bins & Version Control
WWW.SPBIZCONF.COM
• If you really mess something up in a Site collection
• Microsoft makes backup every 12 hours and
keep it for 14 days
• Specify a time range of at least 12 hours
• May take 2 or more days for the restore to be performed
• Only restore option is a Full site collection restore to the same URL
• No restore to a different URL or download backups offline
• Important -- changes made after the backup time will be lost!
• May be possible to use PowerShell, Set-SPOSite cmdlet to lock site, but will result in no access
for users
Service Request Through Portal
WWW.SPBIZCONF.COM
End-to-End SLA
Know all the players and respective SLAs
• Cloud
• SaaS
• SharePoint Admin
• Backup/Infrastructure Team
Administrators who provide service based on a public cloud
infrastructure should define SLAs end-to-end
• Winning a disputed SLA claim does not save your job
• End users just want to ensure service availability, “It’s there when I need it.”
WWW.SPBIZCONF.COM
Filling In Native Cloud Backup Gaps
Need options to send backup files to another cloud location, such as Amazon Web Services (AWS), Azure,
another cloud or customer's on-premises location. Considerations
Backup
• Initial backup and restore options to ship physical media overnight to/from the cloud backup facility.
• Frequency: based on customer needs, not predefined by MSFT (every 12 hours)
• Location: send the backup data to another public cloud provider
• Retention: Allow customers to set up their retention based on their business needs instead of a fixed number of days (such as 14 days or
30 days)
• High fidelity backup: more complete protection or advanced backup policy
• Limitations due to distance from the cloud data center, network bandwidth and throughput
• Source-side deduplication capabilities to help with backup size
Recovery
• Self-service restore/recovery: recover faster without delays waiting for SaaS support's response
• More restore options: in-place, out-of-place (no overwrites), restore to a different location and restore of selected versions (instead of all
versions)
• How long will a restore from the cloud take? Will it meet RTOs?
• Restore time determines if a cloud backup strategy needs to be augmented by local disk backup for faster recovery
WWW.SPBIZCONF.COM
Please fill in my session feedback form available
from the ‘Session Resources’ tab
on my session window.
Why not join us in October at
7 Steps to a Creating a
Successful SharePoint Recovery Plan

More Related Content

Speakers slidedeck sp-biz laporte

  • 1. Online Conference June 17th and 18th 2015 WWW.SPBIZCONF.COM 7 Steps to a Creating a Successful SharePoint Recovery Plan
  • 2. WWW.SPBIZCONF.COM Paul LaPorte Metalogix Email : plaporte@Metalogix.com LinkedIn : linkedin.com/in/paullaporte Expert in SaaS, hybrid and cloud business models, business continuity, disaster recovery and security Co-author: RBS for Dummies, 2013 Edition Former global manager for SaaS solutions at Proofpoint and senior executive of Evergreen Assurance, a pioneer in real-time disaster recovery for mission critical applications
  • 5. Online Conference June 17th and 18th 2015 Change is Inevitable
  • 6. WWW.SPBIZCONF.COM Ready for Recovery • What is your backup strategy or approach? • Do you have a recovery plan? • Have you defined recovery objectives? • Is your plan tested? • Is your backup even tested?
  • 7. WWW.SPBIZCONF.COM Recovery – When, Not If 48% of organizations have had to execute their DR plans1 99.7% of SharePoint organizations have had to recover – some frequently2 • From lost or corrupted content, deleted sites, failed hard disks, etc… • And the other 0.3% didn’t understand the question Organizations experienced data loss or downtime for many reasons3 • 29% due to data corruption • 26% due to accidental user error • 23% due to security breech 3 in 5 companies don’t have a consistent way to measure potential data loss 1. Symantec survey of 900 IT managers, http://www.computerweekly.com/news/2240083547/Disaster-recovery-plans-fail 2. 2014 Metalogix Business Continuity Survey750 SharePoint organizations surveyed 3. EMC 2014 global user survey, 3,300 businesses > 250 employees
  • 8. WWW.SPBIZCONF.COM Learning Objectives Why a recovery plan is critical to your job How to make a successful backup-and- restore plan What is your peer group doing for SLAs What do you do next
  • 9. WWW.SPBIZCONF.COM Case Study: Disaster at Sea • Shipping company • Several planned Dry Dock events • Logistics Application in SharePoint tracks employees as the travel and work at ports – Supplier Companies and Vendors access to confirm travel and dates. – Access via Extranet – Company uses this to track and report • Project was 90% complete. The odd bug and some identified UI issues remained.
  • 10. WWW.SPBIZCONF.COM Case Study: Disaster at Sea • Line up for internal Governance Review to GO LIVE / Production • App was not hardened for Back Up or Recovery. SLA was draft • Non Project rogue employee convinced business unit decided to do POP • Dev environment received Production traffic  Major SharePoint crash • Loss of Time • Loss of Data • Confusion and blame game
  • 11. WWW.SPBIZCONF.COM Recover Point Objective (RPO) • Defined: the maximum tolerable time period in which data might be lost due to a server farm failure • Example: 4 hour RPO means that SharePoint must be backed at least every four hours Mission Critical SharePoint Organizations Require More Aggressive Recover Point Objectives (RPO)
  • 14. WWW.SPBIZCONF.COM SharePoint Backup Dilemma Content Grows Longer Backups More Risk of Data Loss
  • 15. WWW.SPBIZCONF.COM SharePoint Backup Dilemma Content Grows Longer Backups More Risk of Data Loss RPO
  • 16. WWW.SPBIZCONF.COM Backup Delays Cause Missed RPOs Time to back up 9 hours 8 hours 7 hours 6 hours 5 hours 4 hours 3 hours 2 hours 1 hour Recover Point Objective Time to back up 1 TB Content Database
  • 17. WWW.SPBIZCONF.COM Backup Delays Cause Missed RPOs Time to back up 9 hours 8 hours 7 hours 6 hours 5 hours 4 hours 3 hours 2 hours 1 hour Recover Point Objective Time to back up 1 TB Content Database Takes up to 8 hours to backup 1 TB database If RPO is 4 hours, have exceeded SLA by 4 hours!
  • 19. WWW.SPBIZCONF.COM Recovery Time Objective (RTO) • Defined: the maximum time allowed for your environment to be restored after an outage or data loss • Example: 2 hour RTO means that data or farm must be restore within 2 hours of the system outage. Users/Organizations Demand Low RTO due to Critical Nature of Content.
  • 20. WWW.SPBIZCONF.COM Different RTOs for SharePoint Full SharePoint recovery  Access to content vs. Access to SharePoint Restores  Farm  Site  Item
  • 22. WWW.SPBIZCONF.COM Step 1: Start With Business Goals • Consolidate and manage all collaboration content in a single application platform • Enable all users to work together online • Reign in all unsanctioned content storage • Minimize downtime and lost productivity • Add key business partners into workflow
  • 23. WWW.SPBIZCONF.COM Step 2: Get an Executive Sponsor Legitimize Socialize Support Budget Align Protect
  • 24. WWW.SPBIZCONF.COM Step 3: Define Your SLA What are your Recovery Objectives?  How much downtime per event?  How much downtime per month?  How much content can be at risk?  Which content needs most frequent backups?  Which content needs to be recovered the quickest?  What does SharePoint downtime cost the company?  Do I need to be able to recover SharePoint without dependencies?
  • 25. WWW.SPBIZCONF.COM Step 4: Create a Recovery Plan  Documented ownership  Tasks  Responsibilities  Demarcation points  Handoffs A Living, Breathing Document
  • 26. WWW.SPBIZCONF.COM Step 5: Analysis Content  Analyze current environment  Risk profiles  Impact of downtime  Content categorization for risk and impact  Backup requirements per category Not All Content is Created Equal
  • 27. WWW.SPBIZCONF.COM Step 6: Validate  Documented  Signed-off solution  Full tested  Review cycle You Want Me to Prove This?
  • 28. WWW.SPBIZCONF.COM Step 7: Testing  Change is inevitable  Change is often undetected  Fire drills detect change  Update your recovery plans and processes Ongoing Testing is Mandatory
  • 29. WWW.SPBIZCONF.COM Ongoing Testing is Mandatory • Only 1 in 4 organizations have run a SharePoint recovery test2 – Half who test don't document the test results – Only one-third who test determine if they can achieve SLAs – (8% of all organizations!) • When companies test, 70% do not pass their own tests1 • Only 7.5% of companies tested and were successful in a SharePoint recovery • >2% of companies tested and required no change to the plans following the test • 93% of organizations had to change their disaster recovery plans following tests.3 1. Disaster Recovery Preparedness Council, 2013 2. 2014 Metalogix Business Continuity Survey750 SharePoint organizations surveyed 3. http://www.computerweekly.com/news/1280090062/Companies-fail-to-test-disaster-recovery-plans
  • 31. WWW.SPBIZCONF.COM BLOBs: The Root of SharePoint Backup Problems BLOB = Binary Large Object BLOB = binary representation of a file stored in SQL Server (content database) SharePoint content consists of structured data (metadata) and unstructured data (BLOBs) BLOBs are immutable BLOBs are created and deleted but never updated
  • 32. WWW.SPBIZCONF.COM Externalizing BLOBs Makes Content Databases Really Small Time to back up 9 hours 8 hours 7 hours 6 hours 5 hours 4 hours 3 hours 2 hours 1 hour RecoverPoint Objective 1 TB Content Database shrinks to 50 GB  Externalizing BLOBs makes achieving Recover Point Objective easy.  1 TB content database becomes 50GB  BLOBs continuously backed up by StoragePoint  Content DB Automatically backed up by StoragePoint
  • 33. WWW.SPBIZCONF.COM StoragePoint Shatters Limitations  Quick install and easy management within SharePoint  Wide range of connectors for on premise and Cloud storage providers  Integrated with SharePoint Backup for lightening-fast backups. #1 SharePoint Storage Externalization • Up to 98% reduction in SQL size • Expand storage and slash storage costs • Over 100% faster uploads and downloads
  • 34. WWW.SPBIZCONF.COM For More Information: 7 Steps eBook http://www.metalogix.com/Promotions/SharePoint-Backup/WhitePapers-and- ebooks/7-steps-to-creating-a-successful-recovery-plan.
  • 35. WWW.SPBIZCONF.COM What Do I Do Next? www.Metalogix.com/RecoveryChecklist
  • 36. WWW.SPBIZCONF.COM Super, Extra Bonus Material How Disaster Recovery and Continuity Planning are Changing with the Cloud
  • 37. WWW.SPBIZCONF.COM Key Metrics • Backup & Recovery – RPO – RTO • Cloud Service Provider – SLA – Uptime – MTBF – Mean Time Between Failures – MTTR – Mean Time To Recover • Incident Results – Keep track of these too!
  • 38. WWW.SPBIZCONF.COM SharePoint Online Recovery Options SharePoint Online -- 4 Different ways you recover data • Using the Recycle bins and version control • Third party tools that can back up your Data • Service request through portal • Make a manual backup of your Sites or Document library
  • 39. WWW.SPBIZCONF.COM User Recycle bin (First stage Recycle bin) • 90 Days. Previously was 30 days Site Collection Recycle Bin (Second stage Recycle bin) • Max time in 1st and 2nd stage recycle bins is 90 days • Also contains deleted Sites Version Control • Default option is “Off” - good idea to enable it • Versioning takes additional room – but is the best way to get item level recovery. Take the storage hit • No limit on versions. Deleting (or restoring) one version deletes all associated versions Using the Recycle Bins & Version Control
  • 40. WWW.SPBIZCONF.COM • If you really mess something up in a Site collection • Microsoft makes backup every 12 hours and keep it for 14 days • Specify a time range of at least 12 hours • May take 2 or more days for the restore to be performed • Only restore option is a Full site collection restore to the same URL • No restore to a different URL or download backups offline • Important -- changes made after the backup time will be lost! • May be possible to use PowerShell, Set-SPOSite cmdlet to lock site, but will result in no access for users Service Request Through Portal
  • 41. WWW.SPBIZCONF.COM End-to-End SLA Know all the players and respective SLAs • Cloud • SaaS • SharePoint Admin • Backup/Infrastructure Team Administrators who provide service based on a public cloud infrastructure should define SLAs end-to-end • Winning a disputed SLA claim does not save your job • End users just want to ensure service availability, “It’s there when I need it.”
  • 42. WWW.SPBIZCONF.COM Filling In Native Cloud Backup Gaps Need options to send backup files to another cloud location, such as Amazon Web Services (AWS), Azure, another cloud or customer's on-premises location. Considerations Backup • Initial backup and restore options to ship physical media overnight to/from the cloud backup facility. • Frequency: based on customer needs, not predefined by MSFT (every 12 hours) • Location: send the backup data to another public cloud provider • Retention: Allow customers to set up their retention based on their business needs instead of a fixed number of days (such as 14 days or 30 days) • High fidelity backup: more complete protection or advanced backup policy • Limitations due to distance from the cloud data center, network bandwidth and throughput • Source-side deduplication capabilities to help with backup size Recovery • Self-service restore/recovery: recover faster without delays waiting for SaaS support's response • More restore options: in-place, out-of-place (no overwrites), restore to a different location and restore of selected versions (instead of all versions) • How long will a restore from the cloud take? Will it meet RTOs? • Restore time determines if a cloud backup strategy needs to be augmented by local disk backup for faster recovery
  • 43. WWW.SPBIZCONF.COM Please fill in my session feedback form available from the ‘Session Resources’ tab on my session window. Why not join us in October at 7 Steps to a Creating a Successful SharePoint Recovery Plan

Editor's Notes

  1. Click… That’s really going to be the theme of todays webinar- to get involved in the strategy. Backup and recovery can be a dry subject. Let’s be honest, it’s not the sexiest thing talk about. It’s pretty doom and gloom. It’s like buying life insurance. But if you’re prepared, not only technically with what steps you’re going to take, but have also prepared the business, then there’s nothing to worry about. You can sleep well knowing your baby is safe. Click… So today, we’ll either validate the steps you’ve already taken or we’ll fill in the gaps that have been left out of SharePoint’s recovery objectives. We’ll discuss: How you’re involved in the process as a SharePoint admin, architect, DBA, or whatever your role may be. A lot of us may have just had SharePoint dumped on us and we’re treating like any other application we have. If you haven’t realized it yet, SharePoint is a massive platform with a lot of moving parts. It does not fit into a one size fits all recovery plan. We’ll go into about 8 or so pieces that will make you successful. The theme here is going to be focused on creating the right backup strategy for the business. What you’ll notice is that from a technology point of view, backup is pretty simple and you have a lot of options. I always say that backup is a science while recovery is an art. I tell clients and prospects this all the time because anyone can backup SharePoint with a few clicks or scripts. But a surprising few think about preparing for the recovery- like can I restore everything I need? How long will it take to recover from a complete hardware malfunction versus a more common scenario like how long the loss of content will take to restore? When we go through creating a plan here in a bit, the backup piece involves working with the business to come up with a solution while the recovery piece is heavily driven by the technology you choose. Click… From there, we’ll go into what options you have (technology-wise) and what will be a good fit for you. Everything from what’s free OOTB to product add ons. Click… Then, once you’ve weighed your options and understand how many man hours you are willing to put in and how much you’re willing to spend, we’ll discuss who to involve so that your backup and recovery plan becomes a sharply honed process.
  2. The goal is to have the smallest RPO possible
  3. Let’s look at an example and discuss some of Microsoft best practices. First of, MSFT recommends not using OOTB backup tools for content DBs > 200 GB due to risk of missing backup windows and Recover Point Objectives. Microsoft tested native SharePoint and SQL backup and was able to back up only 600 gigabytes in six hours using a high-end server. In my opinion, a 600 GB database is smaller than a typical SharePoint farm these days. MS even states that if you’re using OOTB techniques, limit your content database size to 100 GB and site collection backup should not be used on anything larger than 85 GB. That means you can’t granularly restore content reliably if a site collection is too large. It’s bc the process is too resource intensive and will take too long.
  4. You need to understand your role. Are you responsible if the power goes out and you can’t even communicate with your servers? That’s probably not your responsibility and it will be handled by the infrastructure team. But that team is taking backups and you need to know what state they can get you back to after a disaster. Ask them what tools they’re using and work with them to find the limitations they have with SharePoint Maybe they don’t support only restoring individual content dbs, or individual objects like sites, lists and items in full fidelity- with workflow, versions, and such.
  5. This is a true test of a good recovery strategy. End users can be demanding, and often don’t care about the limitations of SharePoint and its toolset. There are always particular industries – I’m looking at the world of finance and law and such verticals – who’s users are not very understanding and demand a very low RTO.
  6. So where do you start? There’s a lot that must be prepared so we’ll run through the major planning aspects from who to involve to analyzing what you can do to lessen your risk
  7. Get yourself an Executive sponsor Find a tech savvy executive to roll out your plan as part of overall SharePoint governance As I said in the opening objectives, the backup piece of your strategy involves more human interaction than working with the technology. You just need to decide what is possible- here are the RPOs and RTOs we can technically meet, and the business must decide if that aligns with the knowledge workers. This way, you’re not making decisions in a vacuum.
  8. Create the dreaded Service Level Agreement (SLA) and get it signed off by executive sponsors and other stakeholders Define acceptable levels of what can be recovered and how long that recovery process will take. Define certain locations of SharePoint that you know contain documents that are heavily edited and constantly changing and may need to be backed up more often than stale locations. Keep in mind to separate disaster from day to day recovery…The majority of recovery work you’ll do is simple document, list or site recovery. These are simple to backup and restore on the surface but they will probably be different from environment to environment. Maybe there’s a customization you didn’t even know about because it was implemented before you got there. Are there certain dependencies on this content that need to be restored as well or are is the business ok with simply getting the document back online? All of this needs to be considered and documented in the SLA
  9. Document the ownership of tasks, responsibilities, demarcation points, and handoffs This will involve different parts of the business. We need to ask questions and assign responsibilities to the different part of the business who have a stake in SharePoint. Have quarterly meetings with key stakeholders in each department. Go over their section of the farm to know what is important to them. Give ownership so the SLA can be updated and signed off by these folks. These people can also be your QA after a restore is finished. Taking these steps will involve making the SLA a public document. The onus does not fall squarely on you once this document is public. You’ve provided steps to take along the way from a backup plan to a restore methodology but it’s a team effort that all are aware of.
  10. Next, Conduct an impact and risk analysis of current environment For example, I had a client who had a site collection that housed financial reports and data for the executive team. Content was added to it every day between 8am and noon. The admin knew he needed the shortest RPO possible on this section of the farm. After discussing the actual business case with the executives, he realized that the content was only being added to the site for read only publication and was not being edited AND it wasn’t the only system of record of the document. It wasn’t SharePoint’s responsibility to immediately back up these documents and once a day would suffice for the business. He saved himself a lot of time and money by having a conversation. On the other hand, you may have some databases or sites that need to backed up more often then the rest of your farm. Ensure that these sections of the farm get special treatment and have shorter RPOs.
  11. Yes, you must prove it! All this planning and research you’ve done needs to be tested and constantly reviewed…. Proving out your SLA goes a long way so you wont have to act like this guy in the cartoon and scream for help.
  12. Did I mention you need to prove it?! I can’t stress this point enough- mostly because it’s rarely accomplished: Conduct ongoing fire drills! Continually review the outcomes. SharePoint has probably changed since you last did a test and you don’t want to be caught with your pants down. Once you conduct fire drills, and based on their outcomes, update the SLA document with the changes to the farm and changes to RPOs and RTOs…..Then you can start the process all over again… It will be worth it in the end. You’re not ignoring the fact that a loss of data will happen. It’s when, not if. And if you’ve put these processes into place, you’ll be prepared. My clients always find something to change when conducting fire drills. I once worked with someone who noticed that their backup sets were corrupted but there was no clue in the actual backup file itself. It was only surfaced once the restore failed. This ended up being exponentially detrimental because he was using incremental backups. To remind you, incremental backups go back to the last backup taken no matter what type, and only backup what has changed. This means that if one backup is corrupted, the rest of the future backups are useless since they’re all dependent on each other. A best practice I usually recommend is to conduct these fire drills on a secondary recovery environment. In a true disaster, you can even restore the databases to this recovery environment and redirect users there. Don’t forget that a large portion of a successful strategy involves communication and ensuring the expectations of the business is set. The point of preparing these documents and doing all this planning is that the expectations of the users don’t outstep reality. Your job is on the line and you need a way to ensure end users to not get angry because of their outlandish expectations. Funny story, I had a client who would send an email that said “Site back, relax and grab a cup of coffee. Your content will be with you shortly” when he received a ticket to restore lost data. Attached to the email was the SLA. Setting expectations to the business is key to your strategy. It’s not all about the technology.
  13. A review of what we’ve been discussing but use this as your checklist… Attain an Executive sponsor Service Level Agreement (SLA) signed off by executive sponsor and stakeholders Documented ownership of tasks, responsibilities, demarcation points, and handoffs Conduct risk analysis of current environment Fully tested, documented and sign off Ongoing fire drills and updates to SLA
  14. So those recommendations are well and good but it doesn’t address the root of the problem. BLOBs or Binary Large Objects make up 90-95% of your databases. This is the content. It’s considered the unstructured data while the metadata of the document is the structured portion. Basically, BLOBs are what is growing your environment so rapidly and extending your RPOs. The interesting thing about them is that blobs are immutable. They are never updated, only created and delete. If they never change, why do we have to back them up every time? You don’t. Think about it- every time you back your farm up for granularly recovery, you’re backing up data you know has not changed. In reality, you only need to backup the blobs that have been added to SharePoint because new documents were added or existing documents were edited. Microsoft provides a couple of different libraries to externalize these BLOBs outside of the content database. Docs can be put on devices that have faster read/writes, i/o, and are much cheaper. This is a win for performance and cost and if done correctly, can drastically shorten your RPOs.
  15. Metalogix backup integrates with our or RBS product called StoragePoint. With StoragePoint, blobs are backed up continuously as they are added to SharePoint. They are not included as part of the backup because they have not changed. Thus, once a restore is initiated, a call will be made to grab the BLOB based on the pointer to it in the database. Your backup time is drastically reduced because you only have to backup your databases now. And guess what, once BLOBs are externalized, your database shrinks by 95%. Thus your backup window shrinks drastically. So back to our example a couple of slides ago, your 1 TB content database has become 50GB. BLOBs are immediately backed up and ready at any point to be restored. Clearly this changes your backup strategy so any RBS product you look at should do a couple of things: 1) Retain blobs even if they’re deleted in SharePoint for a specified amount of time. This means that you can continue to use out of the box methods for restores. Let’s say you define a blob retention period of 30 days. This means you can restore your SQL or SharePoint backups from anytime within the last 30 days and those backups will have references to blobs that have been retained. Or 2) Ensure your RBS product integrates with your 3rd party backup product. Obviously this provides more automation and can really be the subject of a whole other webinar.