Here’s the situation: You’re the CIO or similarly empowered representative of an organization. Different voices within your business are calling attention to the awesome scalability and power of hyperscale computing, which you’ve also noticed with increasing interest. Now the word comes down from on high that you’ve been tasked with designing and implementing your company’s hyperscale computing solution—whatever that should be.
Your organization already has an ambitious agenda in mind for whatever IT infrastructure you wind up choosing. The company is working with extremely high volumes of data and expects this situation to continue or grow. So, you’ve got a ton of assets earmarked for data storage, with many more workloads on the way.
Your business isn’t expecting this data to collect dust in a vault, either. Company leadership is already trumpeting new data processing applications and how smoothly already favored apps will be integrated into the new, high-performance system. Still others are wowed by the promise of artificial intelligence (AI) and automation that hyperscale data centers offer.
For these reasons, there’s a lot of positive support for going ahead and building a hyperscale data center, customized expressly for this business. However, that’s just one set of opinions.
Other leaders remind you that the company’s primary allegiance is to the bottom line and that your solution needs to be cost-effective and “thread the needle” by providing the most return for the least investment possible. These voices forcefully advocate using a colocation solution, where your company will instead rent space in a hyperscale data center, thus saving USD millions or even billions in construction costs and other associated charges.
Both options offer compelling arguments for their adoption. So, hyperscale vs. colocation—which do you choose?
When deciding between building hyperscale data centers or renting in colocation data centers, there are many variables for hyperscale customers to consider. To effectively weigh the decision, one must consider the total costs of ownership versus renting—plus a range of other, tangentially related issues.
Both data center options have their advantages and disadvantages. Both options deliver complex Software-as-a-Service (SaaS) solutions. Both are based on complicated digital infrastructures and depend on virtualization, the underlying concept of cloud computing.
To make the smartest decision possible, it’s important to first focus on each option, and check out their advantages and disadvantages.
Hyperscale data centers represent data storage on a gigantic scale. According to the Independent Data Council (IDC) definition of a hyperscale database, as reported by VIAVI Solutions (link resides outside ibm.com), to be considered a true hyperscale data center, it must contain at least 5,000 servers and occupy at least 10,000 square feet of physical space. There is no hard-and-fast guideline concerning energy usage, although most hyperscale data centers use somewhere between 100 megawatts (MWs) and 300 MWs.
It’s not an overstatement to say that creating an on-premises hyperscale data center from the ground up is a major endeavor—one that will require deep pockets and considerable effort. Even a simple listing of basic components provides a sobering idea of the project’s overall complexity and pricing:
Keep in mind that this list is in no way comprehensive and doesn’t represent the full costs associated with building a hyperscale data center. For example, it doesn’t include the primary asset needed for such activities: electricity. Nor does it mention the complicated and possibly expensive agreements that will need to be struck with local governments and communities. The list does, however, suggest how complex and multi-faceted this undertaking can be by showing that what’s being constructed is nothing less than a full-scale data factory.
Some organizations are interested in the power and potential of hyperscale computing but have no wish to build their own data center, especially once they see a breakdown of all associated costs.
The concern of such companies is understandable and legitimate; the sums of money required to operate in this space are not small. Depending on the facilities constructed, some use cases will require USD millions or even billions. Businesses building hyperscale data centers do have to maintain deep pockets—both when they construct the facility and then during its entire operation. A hyperscale data center should not be considered a one-time purchase.
Colocation, on the other hand, is simply a situation wherein one company owns a hyperscale data center and rents out its facilities, servers, bandwidth and/or space to interested businesses that presumably do not have their own data center facilities.
The obvious benefit for the company renting space in the off-site data center is that doing so saves it from having to make its own large-scale investment into a hyperscale data center. Viewed another way, what the company is, in fact, purchasing when it rents out data center space is flexibility. By opting for colocation, the company can delay the significant investment in a hyperscale data center until a later time, keeping its cash reserves right where they are.
There’s even a specific term for when smaller companies make a gradual entry into this market— retail colocation, which allows organizations to start slowly. Typically, this plays out like this: The company sets up servers within a colocated environment and starts getting the feel for the process. Then the business begins experimenting with different cloud deployments, trying on different configurations and checking them for fit.
So, colocation (and retail colocation) allows businesses to enjoy the benefits of hyperscale, without the major investment required to make it happen. On the other hand, nobody’s going to let a company use their hyperscale data centers for free, as anyone trying to negotiate such an arrangement soon learns. Further, since you’re renting these facilities, you should probably accept the fact that a colocated data center will not provide the same perfect “fit” as custom-built hyperscale data centers.
Regardless of whether you choose to build a hyperscale data center or rent space through a colocated data center, organizations usually need to designate someone as an integrator to shepherd the project across all possible hurdles and into completion.
For the purposes of this blog post, the integrator is you.
It’s interesting that at such a “modern” time as this, when whole new vistas of computing are opening up, that some people’s attitudes are still trapped in old ways of thinking. For example, when you see the phrase “buy or rent,” certain longtime assumptions can still come to mind:
While the first two statements contain some measure of truth, the only statement that’s fully true is the final bullet point. Renting options, it turns out, are indeed perfectly designed for people whose work keeps them in constant motion and relocating to different areas.
So, there’s validity to the third bullet point. The first two contain various levels of validity.
Let’s take the second bullet point first. While it is true that most people and companies can afford the purchases they make, you can’t always bank on it. We’ve seen example after example of companies, families and individuals who were supposedly wealthy—until it was suddenly discovered that they weren’t. Bottom line: Without seeing a verified financial balance sheet, there’s really no way to know what’s going on with a company or a person and who can afford what.
But it’s the first bullet point that can be most misleading because it could definitely be a valid business strategy for a company to avoid building hyperscale data centers and instead rent colocation facilities and services. Businesses have to walk a complicated financial line and part of that is keeping enough cash in reserves. Hyperscale data centers can cost USD millions or even billions to build and operate, quickly turning a thriving company cash-poor because of the massive commitment it has made to this technology.
Prepare to be surprised, because some of the world’s biggest users of wholesale colocation services are actually Amazon (AWS), Google and Microsoft. Actually, each of the “Big Three” data center providers either currently rent out some of their space from other colocation providers or have done so in the past. For example, members of the “Big Three” have leased significant data center space from a company called Equinix, which owns and runs 260 data centers in 71 markets. Then, these cloud service providers turn around and rent out their newly acquired space to users.
That this practice occurs should surprise no one; most economies function using mid-level distributors that are instrumental in moving goods or services from Point A to Point B. Still, based on their collective clout in this market, it is intriguing that some of the world’s largest providers of data center infrastructure are also some of its biggest customers.
Not that there’s any shortage of external data center customers. The market is sizzling hot right now and has been for a while now. For example, Synergy Research Group reported in April 2023 that the first quarter of that year saw a USD 10 billion jump over Q1 2022 spending on data centers (link resides outside ibm.com).
Most recently, there has been a perceptible shifting of corporate assets among giant companies to enable their concentration on establishing data centers. As real estate analyst CoStar (link resides outside ibm.com) reported in October 2023, “While Microsoft and other tech giants such as Google, Meta and Amazon have made deep cuts to their once-vast office portfolios, they have increased spending on development of data centers.”
For companies who are strictly compelled by the bottom line, the matter routinely comes down to the simple comparison: Which is the cheaper alternative? But divining the answer to that rather complicated question usually involves a number of areas of comparison, some involving tangible quantities but others based on various intangibles that usually come into play and which should be considered:
However, by this point, you will have surely realized that when it comes to data centers of this magnitude, a bottom-line comparison doesn’t tell the entire story. There are many questions and considerations to entertain. Before any organization selects either of these solutions, it would be wise for it to engage in a bit of serious soul-searching about what kind of company it seeks to be, both now and in the future:
As you’ve probably already figured out, there are few quick or easy answers when it comes to deciding between building a hyperscale data center or renting space from a colocated data center. There are simply way too many issues at work to make a casual decision one way or the other.
It might even be such a weighty decision that you need more than one person to help make the call. If so, the integrator may need additional staff to help divide up and tackle the different decision areas that require extra study. For example, you may need to assign a staff member to handle any ecological impact statements that will need to be drafted and submitted before a building site can be legally cleared for construction.
As for parting advice, make sure you do your homework in full. The stakes of this decision are very high indeed. Not only are we discussing vast sums of money, we’re also talking about issues that have a direct and perhaps lasting bearing on an organization’s fiscal health. Therefore, when feasible, get the freshest data that exists. Create contingency plans defined according to data-center-performance levels. You must be thorough in your thinking and cover every angle possible. That’s how you’ll draft a blueprint for success—no matter which type of system you choose.
Solutions like IBM Storage Scale help you grow a global data platform and both define and refine your organization’s data strategy approach.
Explore IBM’s roadmap for creating an open quantum software ecosystem. Discover how quantum technologies are poised to reshape industries and accelerate innovation with frictionless quantum development.
Explore the vast potential of supercomputing, where massive computational power meets real-world applications. Learn how supercomputers drive breakthroughs in AI, scientific research and large-scale simulations.
Learn how cloud solutions revolutionize high performance computing (HPC) with enhanced agility, flexibility and security. Discover real-world examples of how businesses are using IBM Cloud HPC to tackle complex workloads while staying scalable and cost-effective.
Explore how the convergence of high performance computing (HPC) and artificial intelligence (AI) is driving industry breakthroughs. Learn about real-world use cases.
Explore how Cadence uses IBM Cloud HPC to enhance chip and system design, delivering faster, more efficient solutions.
Learn how parallel computing revolutionizes data processing, delivering faster results for complex tasks and driving enterprise growth.
Optimize your HPC environment with IBM FlashSystem. Handle massive amounts of data efficiently with scalable, high-performance storage designed for compute-heavy workloads.
IBM Power is a family of servers that are based on IBM Power processors and are capable of running IBM AIX, IBM i and Linux.
IBM Spectrum LSF Suites is a workload management platform and job scheduler for distributed high performance computing (HPC).