Skip to main content

Earnings Call

Cerebras Systems Inc. (CBRS)

Earnings Call 2026-03-31 For: 2026-03-31
Added on June 24, 2026

Earnings Call Transcript - CBRS Q1 FY2026

Operator

Good afternoon, and welcome to Cerebris Systems' first quarter fiscal year 2026 earnings conference call. Currently, all participants are in a listen-only mode. Following management's prepared remarks, we will open the call for questions. Please note that today's call is being recorded. I will now turn the call over to Sean Dorsey, Head of Investor Relations.

Bob Komen, CFO

Please go ahead.

Sean Dorsey, Head of Investor Relations

Thank you, Operator. Good afternoon, everyone, and welcome to Cerebra Systems' first earnings call as a public company. Earlier today, we issued our press release and posted our supplemental earnings presentation to the Investor Relations section of our website. A replay of this webcast will also be available on our Investor Relations website following the call. Joining me today are Andrew Feldman, our co-founder, chief executive, and president, and Bob Komen, our Chief Financial Officer. Before we begin, I would like to remind everyone that today's discussion will include forward-looking statements under the safe harbor of the Private Securities Litigation Reform Act of 1995. These statements include, but are not limited to, statements regarding our future financial performance, business strategy, market opportunity, customer demand, product roadmap, technology leadership, supply chain, operating model, and outlook for Q2 and full year 2026. Forward-looking statements are based on current expectations and assumptions and are subject to risks and uncertainties that could cause actual results to differ materially from those expressed or implied. These risks are described in our SEC filings, including our final prospectus related to our IPO and our future periodic filings with the SEC. We undertake no obligation to update these forward-looking statements except as required by law. During today's call, we will also discuss certain non-GAAP financial measures. Reconciliations between GAAP and non-GAAP results are included in today's press release and supplemental materials, which are available on the Investor Relations page of our website. With that, I'll turn the call over to Andrew.

Andrew Feldman, CEO

Thank you, Sean, and thank you everyone for joining us today. This has been an extraordinary several months, and I want to begin by thanking our customers, partners, suppliers, employees, and shareholders. We would not be here without your trust and your support. Earlier today, we posted our q1 2026 results and we delivered a strong quarter we delivered core revenue of 191.3 million up 92 percent year over year core hardware revenue contributed 111.6 million up 60 percent year over year while core cloud and services revenue contributed 79.8 million up 167 percent year over year. Bob will share more color on our financial results shortly. Before Bob digs into that, I'd like to say a few things about the market. I'll divide my comments into several sections. I'll begin by spending a few minutes sharing my views on the larger drivers underpinning the AI revolution, their impact on the compute market, and why speed wins. I'll then turn to our successes in Q1 with special attention to our progress with OpenAI and AWS. And finally, I will talk about how we expect to avoid many of the supply chain challenges that bedevil others in our space. To understand the dynamics in the compute market, it's important to realize that AI provides new capabilities to computers. AI gives computers purchase on whole swaths of the world that had previously been foreclosed. This is why AI is so transformative, and why its impact is so profound, and why we believe it increases the size of the market addressable to compute by many thousands of times. Computers have historically been good at math, very good, but they were relatively poor at everything else. They did not provide much insight into text or images. For these modalities, all they could do is store and retrieve. Computers were at their best in a 2D world of numbers. In a real world of three dimensions, they were challenged. AI opens up the world of human experience to computers. As a result, the size of the market increases exponentially. It is as if If prior to AI, computers worked in black and white, and in two dimensions, and after AI, they address a world of color in many dimensions. This is why AI has spurred an explosion in the demand for compute. Computers can now do things they have never done before, and why, in our opinion, demand will continue to accelerate for many years to come. Text, images, video, agents, robotics, these are all part of how AI expands the computer's ability to understand, participate, and take actions in the world. These all represent opportunities for Cerebris. Let's look at the specifics of how this is unfolding. Prior to 2025, AI was a parlor trick, a novelty. Interesting, but not useful. Cool, but not valuable. AI is now valuable because it has become profoundly useful. Led by OpenAI, the foundation model providers pioneered the way. The foundation model makers, and shortly thereafter the open source models, made models smart enough to be useful across many domains. And once something is useful, people use it. And once people start using a technology, speed determines its productivity. Fast is productive, and slow is unproductive. Speed provides answers in less time, providing competitive advantage. Speed makes the largest and smartest frontier models interactive. Speed enables agents to complete tasks faster. fast tokens are the most valuable tokens because they get more work done in less time and today cerebrus delivers the fastest ai in the world bar none not by a little bit but by an order of magnitude and we do this for small models for medium models and for the largest models in the industry. We do this for models with small kv cache, with medium kv cache, and with giant kv caches. We generate tokens faster than anyone else. What I'd like to show you right now is a quick demo just of how much faster we are than GPUs on Kimi K2, a trillion parameter open source

Bob Komen, CFO

model. We're going to run the exact same prompt. On the left, it's Cerebrus. On the right,

Andrew Feldman, CEO

it's a leading GPU. The only difference, same model, we're finished already, same model,

Bob Komen, CFO

same prompt, the difference is hardware, and we're finished. It took us 21 seconds.

Andrew Feldman, CEO

We're now waiting on the GPU. Still waiting. Now we've increased this speed 5x in the video to not make you wait as long as you otherwise would. Still waiting. Okay, what Sreebers did in 21 seconds, it took 4 minutes and 37 seconds for the GPU to do. The same model, the same prompt, That's what it means to be 13 times faster. In AI, inference speed is productivity. Slow isn't productive. But this should not come as a surprise. It is in line with each of our everyday experience. How big is the market for slow search? How big is the market for slow internet access? Any of you still use dial-up? How long will you wait for a website to resolve? why would it be different for AI? In fact, not only does speed increase the value of tokens, but speed accelerates the adoption of AI. When AI is fast, it's more fun to use. People use it. They use it more often for more things, and they use it to solve more important problems. With fast AI, users invent things that never existed before. They solve problems in new ways. They develop new offerings, new business models. This is what speed does. And this is what Cerebers' speed enables. A final point on speed, there recently has been a great deal of focus, especially at the frontier model level, on safety and the importance of guardrails. How do guardrails work? Guardrails add a layer of compute on top of the AI to create a safer experience. This compute takes time, and it takes more time on slow infrastructure. Traditionally, guardrails force the trade-off between safety and user experience, between safe and fast. Cerebris eliminates this trade-off. Fast AI inference allows guardrails to work without inserting crippling delays. AI is safer with these guardrails, and AI is safer and more productive when it's blisteringly fast. Our performance advantage is born of our wafer scale architecture. We're more than an order of magnitude faster than GPUs because we solve problems that haven't been solved or couldn't be solved by others. The problems of yield, cross-radical connectivity, mismatches in thermal expansion, power delivery, and cooling are all problems that the industry struggles with but that Cerebra solved years ago. Moreover, the advantages of wafer scale are durable. By building chips that are 58 times larger than the largest competitor, we're able to use SRAM and benefits from its blistering speed, while competitive offerings use HBM, which is slow, expensive, and in short supply. We see the advantage of wafer scale technology expanding our performance lead as we bring next generation solutions to market. In fact, the technology underpinning of wafer scale fundamentally advantages additional technologies in the future. For example, wafer scale technology brings profound advantage to memory stacking and optical integration. And as we look further into the future, data centers in space are also advantaged by wafer scale integration. Not only does wafer scale compute deliver faster speeds and for latency sensitive workloads, less power per unit compute than do GPUs. But most importantly, it requires less chip to chip communication. And chip to chip communication is one of the fundamental limitations of terrestrial data centers and a yet to be solved problems for data centers in space. So with this as a backdrop, in the first quarter of 2026, how did we meet this extraordinary market, and how do we leave Q1 even better positioned? In this section, I'll focus on our partnership with OpenAI and AWS as they took shape in this quarter. We signed a definitive agreement with OpenAI on December 24, 2025, for the purchase of more than $20 billion of Cerebris Compute over the next several years. By February 1st, we were in production, running a model we'd never before seen. 35 days from signature to production deployment. Beyond the transformative revenue ramifications, our collaboration with OpenAI gives us a direct view into frontier model development and the direction it is moving. By pairing frontier model intelligence with the world's fastest inference, we build products and technologies that others simply can't. In fact, the boundaries of these capabilities have yet to be fully explored. OpenAI and Cerebrus are excited that GPT 5.4 is now running on Cerebrus. This collaboration brings together OpenAI's frontier models with Cerebrus's wafer-scale inference infrastructure to enable highly responsive model interactions. GPT 5.4 on Cerebrus is currently available to OpenAI engineers and to select OpenAI customers as part of OpenAI's strategic rollout. OpenAI and Cerebrus are also actively working to bring GPT-5.5 onto Cerebrus as part of the next phase of this rollout and expect to share more shortly. In March, continuing this trend, we signed a binding term sheet with AWS to deploy Cerebrus and AWS data centers. Our solutions will combine AWS's leading Tranium-3 chips with Cerebrus's CS3 in a disaggregated solution that is expected to be an order of magnitude faster. Tranium will do pre-fill, and Cerebrus will be decoded, and together the solution is expected to deliver the fastest tokens at massive throughput. Remember, disaggregated solutions are a significant opportunity for cerebris. The technical strategy is one of divide and conquer. It is based on the recognition that inference has two computational components. The first is where we process the prompt. This is called pre-fill and is highly paralyzable. The second is where we generate the response. This is called decode and is strictly sequential. By using different processors for the pre-fill and for the decode, we can deliver truly exceptional results. We are also proud to announce that we have, as of this week, completed a definitive agreement with AWS and will begin our technical collaboration as well as prepare for deployments in their data centers. As you all know, AWS is a leading cloud compute company and one of the most important providers in the world for developers and enterprises. And many enterprises want to run AI where they store their data and where they have existing agreements and where the environment is familiar and is secure. As a result, AWS provides an easy way for Cerebra solutions to meet the world's enterprises where they already are. Let's for a minute now turn to supply chain. Keeping up with this extraordinary market growth has brought supply chain challenges to many in our industry. At Cerebris, we have several fundamental advantages. First, the binding constraint in the market right now is HBM memory. It's in short supply, it's expensive, and we don't use it. So we avoid this constraint entirely. We use SRAM. And SRAM is printed on our logic wafer. It's not a separate chip. As long as you can make the chip, you can make SRAM. Its supply is approximately infinite. The second binding constraint is the COOS process at TSMC. We don't use it. So again, we sidestep this constraint. Third, the 3 nanometer capacity at TSMC is a constraint. And again, we don't use it. We're the fastest in the world and happily at the 5 nanometer node where there is less contention for fab resources and where manufacturing is less expensive. Our partnership with TSMC deserves special mention, as they know more about chip making than just about anyone else on Earth. They believed in the wafer scale approach from the time we were a tiny team with nothing but a PowerPoint slide, and they've been with us along the way. They have proven themselves to be an extraordinary partner. Just as a reminder, our saleable unit is not our wafer, but our CS3 system. We sell the CS3 for on-premise deployments or time on the CS3 through our Cerebrus cloud or through our partner's cloud. We manufacture our CS3s in the U.S., and in fact, to the best of my knowledge, we are the only accelerator maker to manufacture exclusively in the U.S. We have added hundreds of thousands of square feet of manufacturing and clean room space to support our growth. We've expanded our partnership with Flextronics and are proud to have added Sanmina as our second major contract manufacturer to assist us in managing our expansion. Finally, it's no secret the data center capacity is at a premium. It's a dogfight out there. Despite this, we've added data centers around the world. We've added data centers across the U.S. and Canada, Europe, including France and the Nordics, and we're in early discussions for data centers in Israel, the UAE, Australia, Singapore, India, and Indonesia. We're expanding the capacity we need to serve customers, and we're doing it with urgency. The demand environment is strong, but this is not just about demand. It is about building the infrastructure required for the next phase of AI. So to wrap up, there is a tectonic shift in compute demand brought about by AI's ability to make the world around us tractable for computers. As a result, the market will need vastly more compute, in my view, for decades. AI power users represent today a tiny fraction of the world's population, by some estimates less than 1%. and compute and memory is already in tight supply just imagine to this ai revolution we bring leadership technology which in turn enables us to deliver the fastest ai imprints in the world by more than an order of magnitude fast tokens are more valuable tokens and cerebris tokens are the fastest the result was a record quarter with that i'll turn things over to bob and he can provide more color on the financial results. Bob? Thank you, Andrew, and good afternoon, everyone.

Bob Komen, CFO

I want to also add my thanks to our customers, partners, Team Cerebris, and the investment

Bob Komen, CFO

community, both new and who have gotten to know us over the last several years.

Bob Komen, CFO

Cerebris is more than 10 years into the journey, and we're still just at the very beginning. I want to thank everyone for joining us today on our first earnings call, operating as a public company. The opportunities we see ahead for us with FastAI are massive and we appreciate everyone who has chosen to join us for the road ahead. Today I want to describe the financial framework we will use to discuss our results. It's the same way that we evaluate our financial performance and make resource allocation decisions internally. It provides additional visibility to amounts that are embedded in our reported gap revenue and cost of revenue that we believe provide more transparency as well as direct comparability to our prior historical results to better analyze our trends. Beginning in Q1-26, we have data center costs, which our contract with OpenAI has us pass through to them with a 3% markup. These data center pass-through items are reported gross, so they increase both our cloud and other services revenue and cost of services, but are at a significantly lower margin than the rest of our business. These amounts start out small in Q1, but they'll become more significant over time. Also, OpenAI has the option to choose whether to receive its future committed amounts in our cloud or in its own data centers, which would mean there would be no future corresponding pass-through amounts for that capacity because these amounts can be highly variable and are outside of our control we're excluding them from our core business metrics we also now have non-cash amortization of customer warrants that is recorded as a reduction in revenue for both our hardware and cloud and other services gap revenue line items depending on the related services the customers purchasing. So we're adjusting our GAAP numbers to exclude the impact of these items and a few other common ones like stock-based compensation and one-time items, and we define the resulting non-GAAP amounts as our core business metrics. I will only be discussing these core metrics today. Reconciliations to GAAP for all of our non-GAAP items are available in today's earning material and on our website. I'd like to start with revenues. Q1 was another record quarter for Cerebris. Our core total revenue was $191.3 million, representing 92% year-over-year growth. Now looking at revenue by type, core cloud and other services revenue reached 79.8 million and grew 167% year-over-year. Market demand for Cerebrus Inference Cloud remains incredibly strong. We are ramping our capacity rapidly, and we saw a meaningful pickup in revenue across Q1 as we began our ramp with OpenAI in February, as well as from other customers using the Cerebrus Cloud. We expect increasing year-over-year growth rates for each quarter in 2026, with more of this revenue coming later in the year as the ramp in our cloud capacity deployments accelerates. Core hardware revenue was $111.6 million, up 60% year-over-year. We plan to see decreasing hardware revenue for the next few quarters as our existing POs are delivered and our mix shifts towards the majority of our hardware production being deployed in Cerebris Cloud to fulfill our significant contracts. This trend could change relatively quickly, however, as OpenAI and AWS, as well as other customers, make decisions about when and how they prefer to deploy our hardware solutions in our data centers or theirs. Now moving on to gross margin. Core gross margin was 46.5% in the quarter, compared to 42.1% in the prior year period and 41% last quarter. Core cloud and services margin improved significantly to 52.9% in the quarter from lower levels we saw last year as we launched the Cerebris cloud service. The primary reasons for the increase were higher pricing as the market is now valuing higher speed inference at a premium and market demand exceeds supply. the utilization of our systems that we began to deploy in late 2025 improved quickly and there was a small amount of rent backs relatively speaking to increase capacity from a customer for the rest of 2026 in order to accelerate our ability to service the significant near-term demand in our contracted backlog we've chosen to make more capacity available sooner by temporarily renting our own systems back from an existing customer while we aggressively build out and deploy our own data center capacity. The additional cost of renting third-party capacity will depress core cloud and other services margin temporarily from current levels. We expect the impact to be a decrease of 10 to 15 margin points based on the volumes we are now anticipating before beginning to ramp back towards our target margin of 60% plus as we transition away from our rented systems. Core hardware margin was 42% compared to 30.6% in Q125. Over the last few quarters, we benefited from the timing of incremental performance-based incentive pricing after the target was achieved, but was recognized prospectively for the remaining systems that had not yet been shipped. We expect core hardware margin to be more similar to the first half of 2025 and return to the low 30s as this contract pricing normalizes. As a reminder, when we sell hardware systems and recognize that revenue up front, we also include support and other services which have significantly higher margins. As a result, total profitability over the life of the individual contracts is much closer to our target overall gross margin. These additional elements of revenue are required to be recognized over the contracted life of the services and are recorded as core cloud and other services, so are not included in our core hardware revenue and gross margin. We are focused on improving gross margin over time through scale economies, improved product throughput and performance, manufacturing efficiency, utilization of cloud capacity, and performance-driven pricing improvements. to achieve our long-term overall gross margin target of 60 percent at the same time we will continue to be aggressive and creative including potentially investing ahead of demand when we see attractive long-term opportunities to gain key customers accelerate revenues and drive gains in market share now i'm going to talk about operating expenses our non-gap operating expenses were 92.6 million up 51 percent from a year ago at just more than half the rate of core revenue growth of 92 demonstrating the strong operating leverage available as we grow our business r d was our largest area of investment at 69.8 million we believe sustained r d investment is essential to maintaining our technology leadership and requires being at the frontier of ai across silicon systems software models and cloud infrastructure to deliver the fastest performance we have an exciting product roadmap to bring to market over the next several years including near-term innovations such as the implementation of disaggregated inference solutions with multiple hardware partners which we expect to begin to deliver in the second half of this year Sales and marketing expense was $12.9 million, reflecting continued investment in customer engagement, field capacity, developer adoption, and go-to-market infrastructure to support increasing market demand. G&A expense was $9.9 million and will continue to step up significantly next quarter due to incremental costs associated with operating as a public company and rapid growth in the size of the business. Moving on to profitability, core non-GAAP operating loss improved to near break even at minus $3.5 million with operating margin of negative 2%, a significant improvement from a year ago when core operating loss was 19 minus 19.3 million and operating margin was negative 19 percent. There was also a nice improvement sequentially from Q425 when operating margin was minus 10 percent. Core non-GAAP net loss was 2.5 million. While the temporary reduction in gross margin I described earlier that will result from renting back our systems until we deploy significant capacity in our own data centers will cause these metrics to regress somewhat for the next few quarters. We believe the steady improvement that we delivered over the past several quarters highlights our ability to achieve our target profitability profile of approximately 60% gross margin and 40% operating margin in the medium to long term. Moving on to our current cash position, we ended the quarter with $3.3 billion in cash, cash equivalents, restricted cash, and marketable securities. We've accelerated the pace of our fundraising over the last several quarters to support our increasing growth rate and provide us with the liquidity we need to scale. As a reminder, we raised $1 billion in Series G equity in September 2025, another $1 billion in Series H equity in February 2026, added a revolving credit facility for up to $850 million in April 2026, and then just a few weeks ago completed the largest semiconductor IPO in history, raising another $6.4 billion. We are well positioned with the financial flexibility to accelerate the sourcing and deployment of data centers and our supply chain to support significant near-term growth of our cloud business. Now turning to our outlook, we'll typically provide quarterly guidance, but since this is our first earnings call, we'll also provide some color on the year. In our core business in Q2, we expect core revenue of approximately $194 million, representing year-over-year growth of 88%. Core gross margin in the range of 36% to 38%. Core operating margin in the range of minus 30% to minus 32%. And for the full year 2026, we currently project core revenue in the range of $855 to $865 million, representing year-over-year growth of 69% at the midpoint. Core gross margin in the range of 38% to 41%, and core operating margin in the range of minus 28% to minus 32%. In summary, we made significant progress in our business during the first quarter. We delivered strong revenue growth, gross margin improvement, and meaningful customer momentum. We significantly strengthened our balance sheet through our IPO and our fundraising activities, and we're poised to continue executing on the enormous amount of opportunity we see. We're working hard to bring more data center capacity online as soon as possible to meet robust demand. With that, I'll turn the call back to Andrew for closing remarks.

Andrew Feldman, CEO

Thank you, Bob. Cerebrus was founded on the belief that AI infrastructure needed a new approach, one that was built from a clean sheet. The progress we report today reinforces this belief. The world needs faster AI. Faster AI, like faster versions of all technologies before it, drive adoption, usage, and customer experience. When given the choice, who wants slow? And we're built to deliver fast AI. That's what we do. As AI continues to expand its footprint, so will we. We're proud to be a public company and we're redoubling our effort on the work ahead. We continue to fuel our culture with fearless engineering and with the ability to delight our customers with experiences that are unavailable elsewhere. We also will work diligently to communicate with our stakeholders and our investors and to do so with transparency and with discipline. We thank you for joining us today. Operator, please open the line for

Operator

questions. Thank you. As a reminder, to ask a question, you will need to press star 11 on your telephone. To remove yourself from the queue, you may press star 11 again. Please limit yourself to one question and one follow-up to allow everyone the opportunity to participate. Please stand by while we compile the Q&A roster. Our first question comes from the line of Timothy Arquiri of UBS. Please go ahead, Timothy.

Timothy Arquiri, Analyst — UBS

Thanks a lot. Andrew, now that you have the definitive agreement with AWS, can you sort of help us to think about the timing on this and your ability to supply that customer? I know you had to put in your wafer orders back in February. So can you just give us a little bit of help in terms of when you can start to ship to them?

Andrew Feldman, CEO

Sure. I think TSMC has been extremely good to us. We are in the happy position of having supply for our plan and beyond in 2026. I think you should expect to see AWS's impact in 2027.

Timothy Arquiri, Analyst — UBS

Got it. And then if I could ask a quick follow-up. I also heard, Andrew, you talked about multiple partners for disaggregating solutions um does this imply that there's another customer beyond aws and i guess i asked because i did see that cerebrows had a press i had a you know presence at microsoft build so i'm just wondering what you mean by the multiple partners thanks i think the opportunity

Andrew Feldman, CEO

to provide uh decode for people who have uh gpus is is real and in front of us i i think that's exciting i i think that uh the gpu as an architecture struggles with the sequential nature of decode and we we are extraordinary at it so it makes sense uh to explore partnerships on on that

Operator

vector thank you our next question comes from the line of tom o'malley of barclays your line is open

Tom O’Malley, Analyst — Barclays

tom thanks guys for taking my question and congrats on the next results andrew i wanted to ask you a question on your TAM. I think that during the process, there was a lot of conversation about your ability to handle larger models. When you look at Kimmy, that's one example of a large model. You're again showing a demonstration today about attacking larger models as well. Jensen spent time talking about 25% of the inferencing market is fast inferencing, and maybe even took a step back from that on the last call. But what do you think your TAM is when you look at the broader AI market? Would love to get your opinion there.

Andrew Feldman, CEO

thanks for the question we we look out into technologies and can't find examples of where slow has owned meaningful portions of the market over medium periods of time and i i think you should think very carefully about the example of search right there is no slow search because nobody wants it right there's no more dial-up because nobody wants it and i i think uh when given the choice on the same model between fast and slow i i don't think uh it's a very hard decision and so when we look out at the space uh we see uh the entire inference market as available to us for for fast inference i mean who who doesn't want answers in less time and who doesn't want more productive agents so that's what we see i i know that's at odds with with gpu makers um and both of our our arguments are i think in some way self-interested We build fast, and I think the market's big for fast, so I'm not surprised at that.

Tom O’Malley, Analyst — Barclays

Super helpful, and then we might find this out in the filings, but just wanted to give it a crack on the call. Did you have any top 10 percent customers, and are you willing to share on the call how large they were? Thank you. I don't think we should share on the

Andrew Feldman, CEO

call. I think you'll see in the filings. Thanks, guys. Congrats on the results.

Bob Komen, CFO

Appreciate it. Thank you.

Operator

Our next question comes from the line of Quinn Bolton of Needham & Company. Your line is open, Quinn.

Quinn Bolton, Analyst — Needham & Company

Beth, Andrew, Bob, congratulations on your first call as a public company. Andrew, I wanted to follow up on the inference TAM question. Obviously, you guys are addressing the fast inference portion of the market, which you think allows you to address the entire market, but your tokens may be more expensive. So I'm just wondering if you could address the higher token cost for fast inference. how much of the market do you think is willing to pay a premium for fast inference? And then I've

Andrew Feldman, CEO

got a follow-up on the roadmap. I think today, in many instances, fast is priced at a premium. I think you saw Anthropics offer a service. In fact, most now offer services in which fast tokens are sold at a premium. I think they're sold at a premium because they're more valuable right and I think you can look to your own experience with with your internet provider if dial-up were free do you want it I think the answer there is quite the contrary you'd have to pay quite a bit of money to get someone to take dial-up and so I think that the reason right now that there's a premium is is because people prefer fast it's more valuable

Quinn Bolton, Analyst — Needham & Company

i think we'll see over time how that shakes out and then the question definitive agreement now signed you know if you look across the compute spectrum oftentimes these ai compute deals can can you know extend into the gigawatt range just wondering can you give us any sense of the scale? Is this tens of megawatts, hundreds of megawatts? Could it reach a gigawatt? Any sense on the size of the AWS partnership and definitive agreement? I don't think we're sharing that

Operator

at this time. Sure enough. Thank you. Our next question comes from the line of Atif Malik of

Atif Malik, Analyst — Citi

Citi. Your line is open, Atif. Thank you for taking my questions and congratulations on the debut. Andrew, on the OpenAI and AWS partnerships, what is the decision tree for them to take the future commitments in cloud or as hardware and data centers? So, first, greetings, Atik. Good

Andrew Feldman, CEO

to hear from you again. Second, with AWS, they are deployed in AWS data centers. That's the deal. I think open AI has a choice they can deploy it in their data centers in a model where they buy the hardware or they can receive the compute via cloud service I think it will depend on open AI's sort of a portfolio decision of their data centers and their various capacity versus what we can bring in data centers? I think that's likely to be the determining factor, but I think that's really an important question for them.

Atif Malik, Analyst — Citi

Got it. And Bob, as a follow-up, Andrew talked about the dogfight in terms of data centers and power availability and whatnot. When you look at your full-year outlook, and thank you for providing that on this call. How much of that is new data centers or new power shells versus renting back from your existing G42 customer or your 3Biz cloud? This is Andrew. We're trying to add data

Andrew Feldman, CEO

center space as fast as we can. We're engaged with builders throughout North America, data center operators in in europe in the middle east um we have new data centers coming on board uh in q3 q4 q1 q2 q3 q4 of next year and are are adding more we're in discussions with literally dozens of different data center owner operators uh and so i i think the answer is all of the above we are going to the demand for our product right now is so significant we are seeking data center capacity around the world as quickly as we can thank you our next question comes from

Operator

the line of joe moore of morgan stanley your line is open joe yeah thank you um on the same lines

Andrew Feldman, CEO

is the last question. Is the constraint on your growth five nanometer wafer capacity? Is it space and power and the kind of build out of your cloud? Are there some other constraints that we should be thinking

Bob Komen, CFO

of? It feels like demand is not the constraint here. It's how quickly you can ramp.

Andrew Feldman, CEO

Demand is not the constraint. Supply is not the constraint. The constraint is data center. That's helpful. To the extent that your gross margins are better than we had modeled is that a function of sort of a quicker ramp of that internal capacity versus the g42 rental or just you know what are the dynamics of gross margin through the the rest of

Bob Komen, CFO

this year uh thanks thanks joe there's there's a few things going on um one is actually higher pricing um so we because there's tremendous demand we've been able to see higher pricing from existing customers so even as open ai is starting to ramp that's been an upside to our gross margin profile and something that we're reflecting now in the outlook for the rest of the year you know another way to think about it is the competition has also increased in price they have higher costs for hbm and other things so i think the floor in the marketplace has come up a bit and then we've we've been able to look at the timing of the amount of capacity that we need to bring on and the economics around it which we were estimating a couple of quarters ago and that's also turned out to be a bit more favorable both in terms of how much is coming on when and and also that the amount that we're paying. So I think all of those, all of those factors as they play out for the rest of the year will allow us to be at higher gross margins than what we had, had predicted

Operator

at the beginning. Thank you. Our next question comes from the line of Joshua Bouchater of TD

Joshua Buchalter, Analyst — TD Cowen

Cohen. Your line is open, Joshua. Hey guys, thanks for taking my question and welcome to the fun world of earnings calls. Sorry to keep pulling at this thread. I wanted to follow up on Tom and Quinn's earlier questions about the ability to service some of the larger models. Maybe using the demo that you guys provided of supporting the trillion parameter Kimi model, any details you can give on the specs that were in that benchmark you showed, like how many CS3s were used to support Kimi and maybe what the competing GPU-based rack

Andrew Feldman, CEO

architecture was? Thank you. We used the leading, by way of comparison, we used a leading inference cloud, so we tried to do our best to compare top of tree to top tree my understanding is that they're using B300 to serve as an endpoint for this model but I can double check that for you I think there is a fundamental misunderstanding propagated by some analysts who just didn't understand that our architecture was perfectly suited for these models of large size small size medium the big caches small caches and that that we can do them and are doing them not just in this demo but for open AI at frontier models right There are only two hardware vendors that currently serve OpenAI models. We're one of them. And so it is sort of a proof point, right? An empirical validation that big models work just fine on us, and we have the same advantage as small models.

Joshua Buchalter, Analyst — TD Cowen

Okay, I understand. Thank you for the detail there. And then maybe for Bob, as we think about the annual guide you gave, I think it implies sort of 20% plus half over half growth. Any help you can give us on how much of the second half growth is from pricing or maybe open AI contribution that we should expect for that first, as you build up to the first 250 megawatt build? Thank you. Yeah. Look, I think this initial guide coming

Bob Komen, CFO

out in the in which is really focused on the first quarter and looking forward for the rest of the year where we have data centers coming on largely in the back end of the year a lot of the a lot of the improvement is going to come from open AI being deployed in our cloud and it's in its back end loaded as I mentioned in my remarks at the beginning we actually have in the forecast that hardware will will come down a little bit sequentially for the rest of the year so you know I'm I'm being conservative for the second half as we're still pretty early in the year data center capacity is coming on and as we move throughout the year we'll update you as we have more information about

Operator

the progress and timing. Thank you. My next question comes from the line of Matt Bryson of Wedbush Securities. Your line is open, Matt. Hey, thanks for taking my question.

Matt Bryson, Analyst — Wedbush Securities

Just going back to trying to figure out the market, it sounds like there's some more opportunity for what we're seeing with Amazon where they're using Cerebris solutions as decode. When we're thinking about the amount of value that you're capturing um in that type of architecture versus uh pre-fetch is there any chance you could take a swag at kind of what portion the values in the server system

Andrew Feldman, CEO

um not not exactly let let me share maybe a a different uh crack at the problem uh a decode pre-fill a disaggregated solution is really good in some instances and in particular if you know the shape of the work it's intended to support when you when you specialize right when you buy some hardware for for pre-fill and some for decode you embed in your hardware deployment an assumption about the shape of the traffic and if the traffic looks different then you have stranded compute and low utilization and higher cost this is obviously a huge opportunity for a hyperscaler like the AWS because they have technology that can drive traffic right of the shape they expected to their disaggregated solution and route it to other solutions if it's different from that assumption right so that the value of the solution is highest so hyperscale the exact split of value between us and tranium is very difficult to say and as nobody has yet has deployed a true disaggregated solution we

Matt Bryson, Analyst — Wedbush Securities

have a lot to learn in the market still. Understood. That's helpful. And then just one for you, Bob. When we're thinking about you renting out capacity from a customer to fill that open AI demand, is the full rental requirement baked into your quarterly guide? or is there any chance that there's a further impact on gross margins in Q3? Basically, I'm trying to figure out if gross margins in Q2 are trough.

Bob Komen, CFO

So the rental costs that we're assuming for the rest of the year are baked into Q2 and the annual guide.

Bob Komen, CFO

Awesome. Thank you.

Operator

Thank you. Our next question comes from the line of DJ Rakesh from Missoula. Your line is over to DJ. Hi, thanks, Andrew and Bob. Congratulations on a good quarter and good year.

DJ Rakesh, Analyst — Mizuho

Just wondering, you mentioned 50 megawatts per month RAM into 4G26. I'm just wondering how that is going, and how do you see that scaling into 2027?

Bob Komen, CFO

And a quick follow-up.

Andrew Feldman, CEO

I don't think I mentioned that. uh i'm maybe i didn't hear the question right can you repeat the question uh i think you

DJ Rakesh, Analyst — Mizuho

have talked about uh 50 i believe you are talking about a 50 megawatt per month ramp into 4q26 and then just wondering how that is going and how you see that beyond uh how that

Andrew Feldman, CEO

capacity ramping into 27 yeah okay I I don't remember giving specifics on the monthly ramp we are seeking on average a huge amount of capacity in through the end of 26 and into 27 as you know we signed our agreement with open AI at the end of 25 which means you probably need six or eight or ten months at a minimum to bring on vastly more capacity and as our business ramps we are signing large deals as well many of which will come on in the first part of 2027 I think we announced a 120 megawatt deal with Bell Canada for example in a facility there that does have room to expand. So I think while we haven't given specifics, we are working our hardest to add as much capacity as we can between now and the

DJ Rakesh, Analyst — Mizuho

end of 27. And then obviously mentioned fast inference is very disruptive. You we probably see a lot of LLM front there model guys try to move to faster and faster I'm just wondering on how you see your customer pipeline broadening out into 27 if you were to you know

Andrew Feldman, CEO

look out beyond open AI and AWS thanks sure look we were pleased with with the way the the customer pipeline is going um i think obviously deals of the size of of open ai or the size that aws could do are are few and far between but the business is robust and we're we're happy at the rate at which we're signing new customers we're also happy at the rate at which existing customers are doubling down growing their footprints and the rate at which sort of their token consumption is up into the right and so on on all fronts we're pretty pleased thank you

Operator

our next question comes from the line of richard shannon of craig holland capital group your line

Richard Shannon, Analyst — Craig-Hallum Capital Group

is open richard well thanks andrew bob for uh letting me ask a couple questions congrats on the uh uh first quarter call here um andrew my first question is following up on one of your a couple of different comparative marks regarding OpenAI. You talked about stepping up a new model under 35 days here. Then you also mentioned about doing some work with GPT 5.4. I'd love to hear about your experience in bringing up the first model, the Codex Spark, and what you've learned from that and how you apply that to working with the GPT 5.4. I think you might be going forward

Andrew Feldman, CEO

with open air and or other customers you know I I think with I think foundation model providers are fundamentally different they are at the absolute cutting edge what you see when you engage with them is really quite extraordinary and the amount of work that goes into a foundation model and and the visibility that we have is really one of the exceptional advantages that we get from this partnership. So I think beginning with Spark, we got better. I think it improved us. It challenged us. We were up to the task. We very much enjoy working with their engineering team. And I think from the feedback we've gotten, they found kindred spirits and enjoy working with our team as well. And so I think the way to temper metal is with FIRE. And I think we're proud of our work with them and our continued work. And so I think it's a really thoughtful question. I think having access to extraordinary customers and partners is a fundamental and long-term differentiator.

Richard Shannon, Analyst — Craig-Hallum Capital Group

Thanks for that, Andrew. my follow-on questions regarding AWS. There are media reports out there that Amazon may be trying to sell the Tradium-based hardware externally and not just in their own data centers. Do you view this as an opportunity for Cerberus? I do. Okay, great. Thank you, Andrew.

Andrew Feldman, CEO

Thank you. And with that, I think we'll wrap up. Yes, sir. We have reached the end of the Q&A

Operator

session and that does conclude today's conference call. Thank you for participating. You may now I'll disconnect.