Strategic Consultant

0c5fe232-c744-40fa-be04-ed9ccf495b26-original.jpeg

Advertisements

ICT Architecture Titles by Deliverables

ICT Architecture Titles by Deliverables

 

As a Solutions Architect, I always like to think outcome or deliverables based. So here is a short description of ICT Architecture Titles by Deliverables. There seems to be a lot of confusion around these titles so hope this helps.

These titles vary depending where the nominated titles is allocated inside ICT Supply Chain. (Customer, MSP, Vendor, Solutions Integrator or Distributor) more on this here:- http://www.insidespin.com/channeltopics.php#resellers .

(Loosely based on TOGAF and Zachman)

ICT Architecture Titles and Deliverables

  1. Presales / Sales Engineers (Vendors and SI side only)
    • Competitive Selling
    • Proof of Concepts/Pilots/Health Checks
    • Product Selection / Commercial and Licensing Models
    • Bill of Materials
    • Account Planing
    • Customer Training
  2. Enterprise Architects (Usually only customer side) Quasi CIO level
    • Business Case
    • Strategy/Financial Justifications (ROI, TCO, Net Present Value, RPO/RTOs)
    • Project Managment
    • RFP / RFI Scoping
    • Risk and Governances
    • Technology Roadmaps
    • Reference Architectures
    • Vendor Selection
    • Architecture Practices
    • Sourcing Agreements
    • Service Level Agreements
  3. Solutions Architects/Domain Architects (Infrastructure/Application/Security)(Customer, Vendor or SI)
    • Infrastructure Requirements
    • Responding to RFI/RFPs
    • Statement of Work
    • High Level Design
    • Bill of Materials
    • Services Estimates
  4. Subject Matter Experts (Delivery Focused)
    • Low-Level Design
    • As-Built Documentations
    • Operating Manuals

This is Rule of Thumb only.

Reference

Spaghetti Situations

Spaghetti Situations

 

  • In ‘spaghetti situations’ in which everything is connected to everything, and everything influences everything it is by far not obvious what the best solution is. All people involved have a different idea about what the problem is. And if you ask them, all these people have different ideas about what the solution could;. If you, as engineer, consultant, manager or analyst are in a situation like this, then what to do?
  • In order to solve complex problems its required to define a overall strategy and method that will guide the development of the complex solutions and following a set critical path, budget and timeframe.
  • In order to design and accelerate the implementation of a solution, customers must commit to design decision and any acceptable risks. Multiply components solutions adds to the complexity, through validation and support from vendors is a absolute requirement and they must be invested in the success of the solution
  1. Define Budget
  2. Define Business Requirements
  3. Define Technical Requirements
  4. Define Success Criteria and key metrics
  5. Identify Risks
  6. Solutions Definition
  7. Project initiation
  8. Migration Plan
  9. Operate

Chief Technology Officer (CTO) Job Description

Chief Technology Officer (CTO) Job Description

  • Provide organizational direction with regards to all internal technologies  (production, decision support, productivity)
  • Provide organizational direction with regards to all customer-facing and prospect  technologies (including mobility, data and qualitative products, and user trial)
  • Create technology strategic plans and budgets with emphasis on cost/benefits analysis
  • Manage both in-house, and third party service providers and consultants.
  • Set the overall direction for information technology through strategic planning and evaluation by establishing an IT strategic road map that supports and advances corporate business objectives in concert with Senior Management team
  • Ensure that the IT Department delivers consistent, repeatable and cost effective IT infrastructure, data and voice communications infrastructure, applications support, project management and application development services to both internal and external customers
  • Manage all aspects of IT Department staffing, including employee development and annual salary planning
  • Communicate IT plans and status throughout all levels of the companyWork closely with Sales leadership to develop focused online marketing and sales strategy
  • Provide hands-on experience throughout all phases of the systems development lifecycle
  • Support and foster the existing high energy and morale of the group and foster the continuous improvement culture of the organization and get the most out of the existing teams
  • Co-ordinate and drive the business relationships with key internal and significant external partners
  • Develop and maintain an appropriate IT organizational structure that supports the needs of the business
  • Spearhead the company-wide technical vision.
  • Outline, plan, and execute the technical strategy, architecture, and development goals to meet the business needs of Huawei big data solution
  • Lead and manage the technology planning, the evaluation/decision of alternative technologies, the production of technical documentation, prototype development
  • Manage the technical aspects of platform and products and their compatibility with the ecosystem/ customers’ systems
  • Inspire and work closely with global engineering team to ensure the delivery of the platform and products and oversee the full lifecycle.
  • Work closely with product team to understand and anticipate business requirements

Exploring BCP/DRP and Disaster Avoidance / Continuous Availability (Active/Active) Design

Exploring BCP/DRP and Disaster Avoidance / Continuous Availability (Active/Active) Design

Diaster

If a disaster or IT outage occurred to your business, RIGHT NOW. How confident are you with your DRP? In today’s network economy, down-time is not an option.

DRP and BCP strategy is governed by the CEO and CIO and they are responsible for the solution to define the business impact ,  requirements and investment. IT has to provide guidance to develop a BCP and DR Strategy and get buy-in from the business.

BCP vs DR

  1. BCP – Planning to continue your business operations in case of a disaster.
  2. DRP – Planning to recover from disaster situations – How the IT (information technology) should recover in case of a disaster.

Other Important Definitions

  1. Continuous Data Protection – Replication solutions can be either synchronous or asynchronous, meaning transfer of data to a remote copy is achieved either immediately or with a short time delay. Both methods create a secondary copy of data identical to the primary copy, with synchronous solutions achieving this in real time. This means that any data corruption or user file deletion is immediately (or very quickly) replicated to the secondary copy, therefore making it ineffective as a backup method.
    1. Copy-on-write snapshot – Most snapshot implementations use a technique called copy-on-write, which makes an initial snapshot then further updates as data is changed. Restoration to a specific point in time is possible as long as all iterations of the data have been kept. For that reason, snapshots can protect against data corruption, unlike replication.
    2. Clone/split-mirror snapshot – Another common snapshot variant is the split-mirror, where reference pointers are made to the entire contents of a mirrored set of drives, file system or LUN every time a snapshot is made. Clones take longer to create than copy-on-write snapshots because all data is physically copied when the clone is created. There is also the risk of some impact to production performance when the clone is created because the copy process has to access primary data at the same time as the host.
    3. Continuous data protection (CDP) – CDP is a method of snapshotting that tracks and store all updates to data as they occur. Theoretically, this means CDP solutions can roll back to any point in time, down to the smallest granularity of update. But there is a price to pay with CDP in terms of the cost of storage needed to keep every changed block copy and the performance impact of storing the data. As a result, some vendors implement what they call near-CDP, taking snapshots of changed data at set times and consolidating changes over a longer time period. This means heavily updated data doesn’t overwhelm the capacity of the CDP system. In virtual environments, APIs such as vSphere’s VADP enable CDP solutions to be implemented by third-party software vendors.
  2. Clustering and Availability 
    1. Fault Tolerant 
    2. Highly Available
    3. Metro/GeoClusters
    4. Culstering for Performance / Load Balancing (Scale-out)
  3. Backup – Backup is the process of making a secondary copy of data that can be restored to use if the primary copy becomes lost or unusable. Backups usually comprise a point-in-time copy of primary data taken on a repeated cycle – daily, monthly or weekly.
  4. Archival – Is storing copies of data all version for Long Retention periods, 7 years or more and in Legal Hold requirements for life time of that organisation. 

questions to ask YOURSELF?

Business Disaster Recovery Questionnaire-2014_201412231116104837

  1. Does your business/organization have a business continuity or disaster recovery plan? Not start, In Progress, Complete
  2. What are your most important business functions and how quickly can you resume following a disaster?
  3. How often do you review and test your disaster recovery plan?
  4. Have you established an alternative location where employees can work on key function off site?
  5. Do you have access to multiple, reliable methods of communicating with your employees (emergency phone numbers, pagers, radios or website)?

Disaster AVOIDANCE continuous availability – NO DR RUN BOOKS

  • “40% of all companies that experience a major disaster will go out of business if they cannot gain access to their data within 24 hours.” -Gartner
  • “43% of companies experiencing disasters never re-open and 29% close within two years.” -McGladrey and Pullen
  • “93% of businesses that lost their datacenter for 10 days went bankrupt within one year.” -National Archives & Records Administration

More information on some stats on DR Failures – http://www.continuitycentral.com/feature0660.html

Almost all business that were inside the Twin Towers that didn’t have proper DR technology solutions went bankrupt, a number of companies that develop continuous availability survived and prospered.

BUSINESS CONTINUITY Regulatory Requirements

  • ISO 22301 Business Continuity – http://www.iso.org/iso/catalogue_detail?csnumber=50038
  • ISO – ISO 22301:2012, “Societal security — Business continuity management systems — Requirements”, specifies a management system to manage an organization’s business continuity arrangements. It is formal in style in order to facilitate compliance auditing and certification.
  • It is supported by ISO 22313:2012, “Societal security — Business continuity management systems – Guidance” which provides more pragmatic advice concerning business continuity management.
  • ISO/IEC 27031:2011, “Information security – Security techniques — Guidelines for information and communication technology [ICT] readiness for business continuity” offers guidance on the ICT aspects of business continuity management.
  • United Kingdom – British Standard BS 25999 was a two-part business continuity management standard. “BS 25999-1:2006 Business Continuity Management. Code of Practice” offered pragmatic implementation guidance, but was withdrawn in 2012 when ISO 22313 effectively superseded it. “BS 25999-2:2007 Specification for Business Continuity Management” formally specified a set of requirements for a business continuity management system. It too was withdrawn in 2012 when it was (in effect) replaced by ISO 22301.North America – Published by the National Fire Protection Association NFPA 1600: Standard on Disaster/Emergency Management and Business Continuity Programs.North America – ASIS/BSI BCM.01:2010 published Dec 2010ANSI/ASIS SPC.1-2009 Organizational Resilience:
  • The ANSI/ASIS SPC.1-2009 Organizational Resilience: Security, Preparedness, and Continuity Management Systems—Requirements with Guidance for Use American National Standard is under consideration for inclusion in the DHS PS-Prep, a voluntary program designed to enhance national resilience in an all hazards environment by improving private sector preparedness.Australia – Published by Standards Australia HB 292-2006 : A practitioners guide to business continuity management HB 293-2006 : Executive guide to business continuity management In 2010, Standards Australia introduced their Standard AS/NZS 5050 that connects far more closely with traditional risk management practices. This interpretation is designed to be used in conjunction with AS/NZS 31000 covering risk management.
  • APRA

DR RUN BOOKS ARE THE WRONG WAY 

The complexity of maintaining DR Run Books and complex DR Technology and expensive solutions means that most of the time it is a wasted invested that fails. It is better to develop Continuous availability technology solution.

x – represent human capital and errors!

  1. Business impact analysis xxxx
  2. Business case xxxx
  3. BCP
  4. DRP
  5. DR Reference Architecture
  6. DR High Level Design
  7. DR Detailed Design
  8. DR Design for Applications
  9. Implementation and Configuration
  10. DR RUN Book development and maintenance
  11. Change Management and DR RUN book Mainteance
  12. Yearly DR Test xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

Usually DR RUN books are updated at the time of the DR Test, not during BAU. Changes to the environment will also affect the DR RUN book and Change Management usually neglects to u[date the DR RUN books due to the Human Factor.

Continuous Availability can eliminate the most time consuming error prones areas and maintain DR posture by optimization or even eliminate Stage  11 and 12.
Continuous availability is simply achieved by virtualising the Network layer and Storage Layers. This reduces the overall complexity of DR RUN books and Operation Costs to test and maintain them in a separated environment and makes use of cold or passive datacenters. Initial investment of setting up continuos availability is achieved quickly when you can maintain 100% DR posture compared that to the cost of a maintain DR, isolated testing, exposure and failed compliance  How much does it cost for you to maintain DR RUN Books and test, invest all of these soft opex costs can be use building a Continuos availability solution. Continuous Availability will lower the costs and complexity.

Define RPO/RTO

It order to develop a proper DR solution for any customer, it is imperative to document RPO and RTO and classify and Tier Application. This needs to be documented and signed off in a Business case that should define a BCP plan. Read more info here –https://virtualizationandstorage.wordpress.com/2014/06/23/defining-dr-ha-rto-and-rpo-bcp/

rpo

Event_Timeline

In this business case you will need to allocate a investment to this DR design:

Identify Application Tiers and Uptime

You will also need to identify and classify key Applications and place them into Tiers of importance. A good reference for this classification is to use the uptime institute as a guide. This should also contain uptime requirements for each application Tier.

  • Tier 4 = Tier 1 + Tier 2 + Tier 3 + all components are fully fault-tolerant including uplinks, storage, chillers, HVAC systems, servers etc. Everything is dual-powered.
  • Tier 3 = Tier 1 + Tier 2 + Dual-powered equipments and multiple uplinks.
  • Tier 2 = Tier 1 + Redundant capacity components.
  • Tier 1 = Non-redundant capacity components (single uplink and servers).
  • Tier 1: Guaranteeing 99.671% availability.
  • Tier 2: Guaranteeing 99.741% availability.
  • Tier 3: Guaranteeing 99.982% availability.
  • Tier 4: Guaranteeing 99.995% availability.

WORKLOAD CATAGORIES

  • MISSON CRITICAL – Your highest priority workloads with instantaneous recovery
  • BUSINESS CRITICAL – Your high priority workloads with prioritised failover
  • COMPLIANCE – Your compliant workloads that must meet regulations
  • GENERAL PURPOSE – Your non-critical workloads with a restart 

How-it-Works

BUSINESS IMPACT ANALISYS

…………………..

Define Budget

The following formula can be used to highlight the revenue lost due to a outage:

Lost of Revenue due to outage = $Revenue / 365 Days * (RTO + RPO)

The business case will always to to identify trade-offs, between, Price, Performance, Cost. You might not be able to achieve all of them and its you need to be realistic.

Triangle

Define Success Criteria

Developing a DR Solution, I would class as a Spaghetti problem and requires a method to solve this type of problem.

  • In ‘spaghetti situations’ in which everything is connected to everything, and everything influences everything it is by far not obvious what the best solution is. All people involved have a different idea about what the problem is. And if you ask them, all these people have different ideas about what the solution could;. If you, as engineer, consultant, manager or analyst are in a situation like this, then what to do?
  • In order to solve complex problems its required to define a overall strategy and method that will guide the development of the complex solutions and following a set critical path, budget and timeframe.
  • In order to design and accelerate the implementation of a solution, customers must commit to design decision and any acceptable risks. Multiply components solutions adds to the complexity, through validation and support from vendors is a absolute requirement and they must be invested in the success of the solution

Define Plan

  1. Planning your availability transformation
  2. Analyzing your current state
  3. Assessing your continuous availability readiness
  4. Identifying infrastructure requirements
  5. Designing the solution architecture
  6. Performing a cost/benefit analysis

DR Design Options

I wanted to explore a number of options for DR Design

  1. Active/Active
  2. Active/Hot DR
  3. Active/Warm Passive DR (Standby)
  4. Active/Cold DR Recovery from Disk
  5. DR to Cloud
  6. Azure
  7. Cloud Only

 Disaster Recovery vs Disaster Avoidance

Disaster Recovery Technology Options

  • Network
    • Standard IP LAN
    • Load Balancer
    • WAN Optimisation/QoS
  • Storage
    • Synchronous / Asynchronous
    • SnapMirror
    • SRDF
    • FlashCopy
  • Application
    • SQL Replication/Mirror
  • OS
    •  Veritas
    • Microsoft Always On Clustering
  • Hypervison
    • HA
    • FT
    • Zerto
    • EverRun
    • Manual
    • SRM
  • Data protection
    • Brick and Storage Level Backups
    • CommVault
    • Avamar/Datadomain
    • Veema
    • Symantec
    • Tivoli Storage Manager
  • Compute
    • IBM SystemP PowerHA
    • Stratus
    • HP Service Guard
  • Switch
  • Legacy 

Continuous Availability and Disaster Avoidance technology options

  • Network Virtualisation
    • Stretch vLAN – OVT / VPLS – Virtualization (OTV) can be used for L2 extension between the customer’s data center and the cloud. L2 connectivity allows customers to use the same IP from enterprise network in the cloud without the need to change for accessing workloads in the cloud after recovery.
      Storage Virtualisation
    • IBM SVC, EMC VPLEX, NETAPP V-SERIES – Synchronous / Asynchronous
  • Data protection
    • Storage  Level Backup from Site 2
  • Virtualisation
    • Software Defined Networking – DMZ, Firewalls L2-L7 All virtualised 
    • VMware Metro Storage Cluster

VMware Metro Storage cluster Requirements

OTHER NETWORK virtulisation OPTIONS

  • LAN Extension Deployment Scenarios
  • Ethernet Based Solutions
  • Cisco Nexus 7000 Series vPC (Virtual Port Channel) and Cisco Catalyst 6500 Series VSS (Virtual Switching System) for MAN distances
  • MPLS Based Solutions
  • EoMPLS (Ethernet over MPLS)
  • VPLS
  • A-VPLS
  • EVPN (Ethernet VPN)
  • P Based Solutions
  • Overlay Transport Virtualisation(OTV)
  • Fabric Solutions
  • Application Centric Infrastructure (ACI): VXLAN and Spine-Leaf architectures

Research

Hybrid Cloud and SDDC Conceptual Design

Hybrid Cloud  and SDDC Conceptual DesignWorkload Profiles

  • VDI
  • Database Applications
  • Responsive  Apps
  • General Purpose

Performance Matrix

(Identify where you workload range and design solutions)

  • Compute Low, Medium, High
  • Memory
  • IOPS
  • Storage
  • Networking

Workload Categories

(Networking should be 10 GB or 40 GB infiBand)

  • General Purpose
    • SSD Cache
    • SATA
  • Compute Optimised
    • Latest Intel
    • SSD
    • SAS
  • Memory Optimised – For large databases, SharePoint server farms, and high-throughput applications
    • MAX Memory
    • SSD
    • SAS
  • Storage Optimised
    • FlashIO
    • SSD
    • Large SAS
  • Dataware Warehouse / File Servers
    • Large SAS
    • SMB3/CIFS/NFS

Instances Types (Match Azure)

Networks

  • DMZ
  • PCI
  • WIBs
  • Server
  • Desktop
  • Management

Design Decision

  • 100% Microsoft Cloud, StoreSimple and SuperMicro Server Solutions
  • Hybrid Backup and DRaaS Veeam No SAN Solution
  • VMware vSphere 6
  • FileShares on Azure Steelhead
  • Cloud IaaS (Azure, AWS) are not mature enough for complex Enterprise Workloads and Networking
  • Build to move all workloads to cloud in 3-5 years (Azure or VMware Air)
  • Veeam DR Partner
  • Virtualise Firewall
    • Fortinet or Checkpoint ASA
  • BCP Site
  • Daily Backups
    • Veeam Backup Partner
    • Azure Stheelhead
  • Long Term 7 Year Retention
    • AWS Storage Gateway Announces Gateway-Virtual Tape Library (Gateway-VTL)
    • Microsoft Data Protection to Azure Backup
    • Steelhead Azure WAN Accelerators
  • Co-lo (Managed Firewall and Switch)
  • Use Megaport NaaS – http://www.megaport.com/ecosystem/connected-data-centres.html

Capacity uplifts Standard

What is required to maintain and monitor capacity and availability?

Requirements

  • Windows VSS Enablement

DC

  • NextDC / CloudPlus and Azure ExpressRoute for Veeam Backup and / DR
  • Equinix / Nexon and Azure ExpressRoute

BCP & DR

  • Cloud repositories – use the new Veeam Cloud Connect functionality in v8 to get backups offsite easily and efficiently
  • Offsite replica VMs – maintain VM replicas in the cloud for quick recovery in the event of a disaste
  • Backup-as-a-Service (BaaS) powered by Veeam
    Offload backup to a Veeam Cloud Provider, and free up IT resources for more strategic business-building activities. You continue to run your VMs onsite, while your Veeam Cloud Provider provides backup to meet your RTOs and RPOs using the #1 VM Backup, Veeam Backup & Replication.
  • Disaster Recovery-as-a-Service (DRaaS) powered by Veeam
    Use a Veeam Cloud Provider to replicate your VMs to the cloud for fast recovery in the event of a disaster. You continue to backup VMs as you do today, while your Veeam Cloud Provider provides an added measure of protection by replicating VMs offsite.
  • Note: With BaaS and DRaaS, backup and replication are managed by the service provider (it’s not Veeam Backup & Replication delivered in a Software-as-a-Service, or SaaS model.)
  • http://www.veeam.com/find-a-veeam-cloud-provider.html

Right Sizing Information Gathering

  • VMware Capacity Planner or Microsoft Assessment and Planning Tool
  • NetAp nSanity AutoSupport Reports
  • Veeam One (VMware change)
  • Core Switch Performance Obsivrium
  • MAX/Averages (CPU, Networking, IOPS, Memory)
  • Tape Size
  • Backup Frequency and Sizes
  • RVTools

Compute, Networking and Memory Requirements

  • Storage Total Size 20 TB
  • CPU Mark Total
  • Memory Total
  • Backup Delta / Hot and Cold
  • Growth
  • CIFS Storage
  • Mail
  • SQL DBs
  • Core Switch Bandwidth (Business hours)
    • Average
    • Min/Max

Backup Retention Requirements

No-SAN Issues

Components

Certified Hyper-V Compute and 10 GB Converged Networking

VM Templates

  • Small 1 vCPU, 2 GB memory, 50-GB disk
  • Med 2 vCPU, 4 GB memory,100-GB disk
  • Large 4 vCPU, 8 GB memory, 200-GB disk

Sizing Tools

NetApp Performance

  • COLLECT AUTOSUPPORT FILES

1. Browse to the file share \\FILER_NAME\ETC$\log\autosupport
2. Locate the most recent folder with name YYMMDDHHMM.0 or .1
3. Ensure you have selected the most recent folder with a *.0 or .1 name.
4. Folder should have at least 150 files in it.
5. Package/zip the entire contents of this folder to provide to TD

  • COLLECT PERFORMANCE STATS FILES

1. Browse to the file share \\FILER_NAME\ETC$\log\stats\archive
2. Each *.gz file represents 1 hour of data
3. By default, collect 50 *.gz files (approximately 2 days) of recent activity that represents the typical workload on your filer. You can extend this to 100 files to assess a longer time period if desired.
4. Note: for ONTAP release 7.x browse to \\FILER_NAME\ETC$\stats\archive
5. Zip the Autosupport package with the performance stats package into a single compressed file and send to TD
6. Alternatively, you can forward an autosupport email

Cost Items

  • Azure Costs
    • VPN and Express Connect
    • Backup VM Storage
  • Co-Location
    • Switch
    • FC Switch
    • Internet Connections
  • Design Items
    • VMware
      • vSAN
      • DMZ (NSX/Checkpoint)

Critical Path

  • Costs Solution
  • Engage Microsoft Consulting Services to Validate Design
  • Acquire Rack Spaces and Servers/Equipment
  • Design IP and Networking Layer
  • Configure Routers and Layer 2 Bridge and ExpressRoute and MegaPort
  • Design VM, Server and Hybrid Solution
  • Test Components
  • Migrate Workloads to Equinix (Nexon) Production
  • Migrate Workloads to NextDC DC and setup UAT/DEV
  • Consolidate Workloads to Hyper-V
  • Integrate BackupExcu Tape Recovery

Reference

Migrate to Cloud

  • Cloud Security Policy (SANs Controls + SLA + Monitoring + Encryption + Access + DR + Restore)
  • Azure Backup
  • Extend Network to Azure Virtual Network
  • Azure VMs
  • Azure Site Recovery
  • Microsoft Migration Accelerator
  • Azure Network Extension
  • Application Dependency Mapping
  • DEV/UAT
  • SQL DB
  • File Server
  • Exchange DB
  • Sharepoint
  • Azure Web Sites
  • Responsive Web Servers (Full Solution)
  • Active Directory/2-Factor
  • Managed Firewall
  • Managed Switch
  • Visual Studio Online
  • Sharepoint
  • Load Balancer
  • Docker Apps Apps
  • Develop Cloud Security Policy