結局のところ、私は「AI」から逃げたつもりで、実は「重い計算と向き合う現実」からも逃げていたのだと思います(MASは例外)。

ebata

1か月前

私は、大学生（社会人になる前）のころ、ファジィ推論やニューラルネットワークの研究を行い、這々(ほうほう)の体で修士論文を放り込み、そのまま逃げ出してきたエンジニアです。
When I was a university student (before entering the workforce), I worked on research in fuzzy inference and neural networks, barely managed to submit my master’s thesis, and then fled from the field as an engineer.

AIの冬の時代における、元AI研究者に対する世間の冷たさを身をもって味わってきた私は、「もう二度と“AI”と名前を冠する研究には関わるまい」と決め、今まで逃げ続けてきました。
Having personally experienced the cold attitude society held toward former AI researchers during the so-called AI winter, I decided that I would never again be involved in research bearing the name “AI,” and I kept running away from it.

ところが――どうやら世界は、私にそれを許してくれなかったようです。
However, it seems the world was not willing to grant me that escape.

会社の期首朝礼で、
At the company’s beginning-of-term assembly,

―― 社員の8割を、AIの研究者（技術者）にする
— “We will turn 80% of our employees into AI researchers (engineers).”

という宣言がなされた瞬間、目の前が真っ暗になったのを覚えています。
I remember that the moment this declaration was made, everything went dark before my eyes.

「やっぱり逃げられないのかぁ」と、自分の運命を呪いました。私の予想より5年ほど、AIの拡散が早かった、という感覚です。
“Guess I really can’t escape after all,” I thought, cursing my fate. It felt as though the spread of AI had arrived about five years earlier than I had expected.

あらかじめ「逃げられない」と分かっていれば、その前に打てた手もあったはずなのに――と、溜息をついています。
If I had known in advance that escape was impossible, there were steps I could have taken beforehand—I find myself sighing at that thought.

---

打てた手とは何か。具体的には、次のようなものです。
So what were those steps? More specifically, things like the following.

(1) GPUアーキテクチャとCUDA周りの基礎知識
(1) Fundamental knowledge of GPU architecture and the CUDA ecosystem

CUDAコア／SM（Streaming Multiprocessor）構成、メモリ階層（global / shared / constant）、PCIe転送、VRAM制約、NVIDIAドライバとCUDA Toolkitの関係性など。「GPU＝速い箱」ではなく、「制約だらけの計算機」として理解しておくべきでした。
CUDA cores and SM (Streaming Multiprocessor) structure, memory hierarchies (global / shared / constant), PCIe transfers, VRAM limitations, and the relationship between NVIDIA drivers and the CUDA Toolkit—these should have been understood not as “a GPU is just a fast box,” but as “a computer full of constraints.”

(2) 深層学習フレームワークの“中身”寄りの理解
(2) A more internal understanding of deep learning frameworks

PyTorch や TensorFlow を「使う」だけでなく、計算グラフ、autograd、テンソル配置（CPU/GPU）、dtype（fp32 / fp16 / bf16）の意味。LLM以前の段階で、ここで一度地獄を徘徊しておくべきでした。
Rather than merely “using” PyTorch or TensorFlow, I should have grappled with computation graphs, autograd, tensor placement (CPU/GPU), and the meaning of dtypes (fp32 / fp16 / bf16). Before the era of LLMs, I should have wandered through this hell at least once.

(3) 分散学習・並列化の基本パターン
(3) Basic patterns of distributed learning and parallelization

Data Parallel / Model Parallel / Pipeline Parallel、NCCL、AllReduce、ノード間通信のボトルネック。これを経験していれば、AIの本質が「モデル」ではなく「通信」になる瞬間が来ることを、あらかじめ予見できたはずです。
Data Parallel, Model Parallel, Pipeline Parallel, NCCL, AllReduce, and inter-node communication bottlenecks—had I experienced these, I should have been able to foresee the moment when the essence of AI shifts from “the model” to “communication.”

(4) LLM以前の「巨大モデル運用」技術
(4) “Large model operations” techniques before LLMs

バッチサイズ調整、勾配累積、チェックポイント、OOM回避、推論時のKVキャッシュ。LLMは突然現れたのではなく、「重すぎるモデル運用」の延長線上にあった、という現実を直視できたはずです。
Batch size tuning, gradient accumulation, checkpointing, avoiding OOM errors, KV cache usage during inference—LLMs did not appear out of nowhere; I should have been able to face the reality that they lie on the extension of “operating models that are simply too heavy.”

(5) AIプラットフォームの設計思想
(5) Design philosophy of AI platforms

Kubernetes、GPUスケジューリング、MLOps、モデル配布、推論API化、課金単位。アルゴリズムではなく、「運用前提の設計」が主戦場になると分かっていれば、この点での無知は致命傷にならなかったでしょう。
Kubernetes, GPU scheduling, MLOps, model distribution, inference APIs, and billing units—if I had understood that the primary battlefield would be “design with operations in mind,” rather than algorithms themselves, my ignorance here would not have been fatal.

これら5つに共通しているのは、「AIの理論」ではなく、「AIを動かすための地獄」に関する知識である、という点です。
What these five points have in common is that they concern not “AI theory,” but the “hell required actually to run AI.”

今、私が苦しんでいるのは、LLMが難しいからではありません。LLM以前に存在していた、“計算機・運用・並列化の地獄”を、一気に踏まされているからです。
What I am struggling with now is not that LLMs are difficult, but that I am being forced to wade through the pre-existing hell of computation, operations, and parallelization all at once.

---

振り返ってみれば、「AI研究を避けてきたこと」自体は、必ずしも間違いではなかったと思っています。
Looking back, I don’t think that avoiding AI research itself was necessarily a mistake.

少なくとも当時の私にとっては、合理的な判断でした。
At least for who I was at the time, it was a rational decision.

ブームと失望を繰り返す分野に正面から居続ける体力も覚悟もなかったですし、「AI」という看板を掲げた瞬間に背負わされる過剰な期待値から距離を取る、という意味では、逃げは成功していたと思うのです。
I lacked both the stamina and resolve to stay at the forefront of a field that repeatedly cycles between hype and disappointment, and in terms of distancing myself from the excessive expectations imposed the moment one raises the “AI” banner, my escape was a success.

ただ一方で、明確に戦略ミスだったと思うのは、「計算機資源側」まで一緒に投げ捨ててしまったことです。
On the other hand, what I clearly consider a strategic mistake was throwing away the “computational resources side” along with AI itself.

GPU、並列計算、分散処理、巨大メモリ空間、そして帯域制約。
GPUs, parallel computation, distributed processing, massive memory spaces, and bandwidth constraints.

それらは本来、AI固有の問題ではなく、計算機科学の延長線上にあるテーマでした。
These were never problems unique to AI, but themes lying squarely in the extension of computer science.

にもかかわらず私は、それらを「AIの人たちがやる話」と無意識に線引きし、触れない理由を自分の中で正当化してきたのだと思います。（マルチエージェントシミュレーション(MAS)についての計算機資源はきっちり調査してきたにも関わらず、AIについては意図的に「見ないフリ」をしてきたような気がします）
Nevertheless, I think I unconsciously drew a line, labeling these as “things AI people deal with,” and justified to myself why I shouldn’t touch them. (Despite thoroughly investigating computational resources for multi-agent simulations (MAS), I feel I deliberately pretended not to see them when it came to AI.)

結果として今、私は「AIを学び直している」というよりも、“計算機屋としての再教育”を、強制的に受けている感覚に近い状態にあります。
As a result, I now feel less like I am “relearning AI” and more like I am being forced to undergo “retraining as a computer engineer.”

LLMがどうこう以前に、GPUがどういう制約で動き、どこでメモリが詰まり、どこで通信が律速になり、なぜスケールしないのか――
Before any talk of LLMs, I am confronting how GPUs operate under constraints, where memory gets clogged, where communication becomes the bottleneck, and why things fail to scale—

そういう、極めて泥臭い話を、一気にまとめて踏まされているのです。
All of those extremely gritty issues are being forced upon me at once.

もし当時、AIそのものからは距離を置いたとしても、「計算機資源」「高速化」「並列化」「重いものをどう動かすか」という軸だけは掴みに行っていれば、今の地獄は、もう少し浅瀬で済んだのではないか。そんな後悔は、正直あります。
Even if I had kept my distance from AI itself back then, had I at least pursued the axes of “computational resources,” “acceleration,” “parallelization,” and “how to run heavy things,” this current hell might have been much shallower—that regret is honest and real.

---

もう一つ、最近になって強く思うのは、もし私が「ゲーム」、とりわけeスポーツの世界に対して、もっと深い理解と実践を持っていれば、随分と別の窓口が開いていたのではないか、という点です。
Another thought that has been growing stronger lately is that if I had possessed a deeper understanding and practice in “games,” especially the world of e-sports, a very different gateway might have opened for me.

ゲーム、とりわけeスポーツは、GPU、低レイテンシ、フレームレート、入力遅延、並列処理、リアルタイム性の塊です。
Games, especially e-sports, are a concentration of GPUs, low latency, frame rates, input lag, parallel processing, and real-time performance.

しかもそれらを、理論ではなく「体感」として叩き込まれる世界でもあります。
Moreover, it is a world where these are hammered in not as theory, but as lived, physical experience.

AIやLLMが突然「重い」「遅い」「金がかかる」と騒がれ始めたとき、ゲームの世界にいた人たちは、わりと冷静です。「まあ、そうなるよね」と。彼らはずっと、計算機資源と殴り合ってきたからです。
When AI and LLMs suddenly began to be criticized as “heavy,” “slow,” and “expensive,” people from the gaming world were relatively calm: “Well, of course.” They had been fighting with computational resources all along.

もし私が、GPUを“研究用の装置”ではなく、フレーム落ちする敵、遅延する敵として日常的に相手にしていれば、AIプラットフォームやLLMの世界にも、もう少し自然に入っていけたのではないか。そんなことを考えるようになりました。
If I had dealt with GPUs not as “research equipment” but as everyday enemies that cause frame drops and latency, perhaps I could have entered the world of AI platforms and LLMs more naturally—that is what I have come to think.

---

結局のところ、私は「AI」から逃げたつもりで、実は「重い計算と向き合う現実」からも逃げていたのだと思います(MASは例外)。
In the end, I think that while I believed I was running away from “AI,” I was actually running away from the reality of confronting heavy computation.

そして今、その現実が、名前を変えて追いかけてきた。
And now, that reality has come chasing after me under a different name.

世界は、私に「AI研究者」になることを強制しているわけではないのでしょう。
The world is probably not forcing me to become an “AI researcher.”

ただ、「計算機と本気で向き合え」と言っているだけなのかもしれません。
It may simply be telling me to “face computers seriously.”

そう考えると、これは運命というより、単なる「宿題の提出遅れ」なのだろうな、とも思うのです。
Seen that way, this feels less like fate and more like nothing more than “being late in turning in an assignment.”

Over the AI ――AIの向こう側に