On November 21, 2014, at Supercomputing 2014, I participated in a panel on "Beyond Von Neumann, Neuromophic Systems and Architectures" organized by Mark E. Dean along with R. Jacob Vogelstein, Karlheinz Meier, Kris Carlson, and Dhireesha Kudithipudi.
This week, IBM and Cornell team presented a ACM Gordon Bell Prize Finalist paper at Supercomputing 2014:
Title: "Real-time Scalable Cortical Computing at 46 Giga-Synaptic OPS/Watt with 100× Speedup in Time-to-Solution and 100,000× Reduction in Energy-to-Solution".
Authors: Andrew S. Cassidy, Rodrigo Alvarez-Icaza, Filipp Akopyan, Jun Sawada, John V. Arthur, Paul A. Merolla, Pallab Datta, Marc Gonzalez Tallada, Brian Taba, Alexander Andreopoulos, Arnon Amir, Steven K. Esser, Jeff Kusnitz, Rathinakumar Appuswamy, Chuck Haymes, Bernard Brezzo, Roger Moussalli, Ralph Bellofatto, Christian Baks, Michael Mastro, Kai Schleupen, Charles E. Cox, Ken Inoue, Steve Millman, Nabil Imam, Emmett McQuinn, Yutaka T. Nakamura, Ivan Vo, Chen Guo, Don Nguyen, Scott Lekuch, Sameh Assad, Daniel Friedman, Bryan L. Jackson, Myron D. Flickner, William P. Risk, Rajit Manohar, Dharmendra S. Modha
Abstract: Drawing on neuroscience, we have developed a parallel, event-driven kernel for neurosynaptic computation, that is efficient with respect to computation, memory, and communication. Building on the previously demonstrated highly-optimized software expression of the kernel, here, we demonstrate TrueNorth, a co-designed silicon expression of the kernel. TrueNorth achieves five orders of magnitude reduction in energy-to- solution and two orders of magnitude speedup in time-to-solution, when running computer vision applications and complex recurrent neural network simulations. Breaking path with the von Neumann architecture, TrueNorth is a 4,096 core, 1 million neuron, and 256 million synapse brain-inspired neurosynaptic processor, that consumes 65mW of power running at real-time and delivers performance of 46 Giga-Synaptic OPS/Watt. We demonstrate seamless tiling of TrueNorth chips into arrays, forming a foundation for cortex-like scalability. Unprecedented time-to-solution, energy-to-solution, scale, and performance of TrueNorth, combined with underlying flexibility of the kernel enables a broad range of cognitive applications.
The long-term aspiration is to build a "1%-human-scale" system with 4,096 processors one trillion synapses and that consumes merely 4kW.
Illustration Credit: William Risk
On May 10-11, 2006, I chaired IBM's Almaden Institute on Cognitive Computing. The Institute brought together over 165 attendees from over 57 different institutions and featured prominent speakers and panelists: Nobelist Gerald Edelman, The Neurosciences Institute, Henry Markram, EPFL/BlueBrain, Robert Hecht-Nielsen, UCSD, Jeff Hawkins, Palm/Numenta, James Albus, NIST, Theodore Berger, USC, Kwabena Boahen, Stanford, Ralph Linsker, IBM, Jerry Swartz, The Swartz Foundation, V. S. Ramachandran, UCSD, John Searle, UC Berkeley, Joaquin Fuster, UCLA, Leslie Valiant, Harvard University, Toby Berger, University of Virginia, and Christof Koch, Caltech.
Today, 8.5 years later, I chaired IBM's Cognitive Systems Colloquium with the goal of Taking Brain-Inspired Computing to Market. The Colloquium gathered over 200 eminent, innovative thinkers from academia, government, industry, research and the media.
Here are speakers and panelists:
The Colloquium also included a SyNAPSE Deep Dive that covered: TrueNorth Architecture, TrueNorth Neuron Model, TrueNorth Chip with 1 million neurons and 256 million synapses, Single Chip Board, 16 chip Board, Future Scaling Path, TrueNorth "program", Compass Simulator, Corelet Programming Language, SyNAPSE University, and Demos.
Videos from the event will be available in the near future. Here is a summary.
Illustration Credit: Aaron Cox
Six years ago, IBM and our university partners embarked on a quest–to build a brain–inspired machine–that at the time appeared impossible. Today, in an article published in Science, we deliver on the DARPA SyNAPSE metric of a one million neuron brain–inspired processor. The chip consumes merely 70 milliwatts and is capable of 46 billion synaptic operations per second per watt–literally a synaptic supercomputer in your palm. Here are links to IBM Press Release, DARPA Press Release, and Steve Hamm's blog.
Along the way–progressing through Phase 0, Phase 1, Phase 2, and Phase 3–we have journeyed from neuroscience to supercomputing, to a new computer architecture, to a new programming language, to algorithms, applications, and now to a new chip—TrueNorth.
Vertically–Integrated SyNAPSE Ecosystem
(A Clickable Map of Papers)
Illustration Credit: William Risk
Let me take this opportunity to walk you down this road less travelled (rather, previously untraveled!). At such an opportune moment, I hope this reflection and vision will incite within you a burning desire to collaborate and partner with us to make the future journey a joint one.
Today's computers can be traced back at least to Blaise Pascal's 1642 mechanical calculator. The modern era in computing started with the unveiling of ENIAC on February 14, 1946. The development of the transistor in 1948 enabled the creation of integrated circuits in 1958, which, in turn, enabled the first microprocessor in 1971. Since then the clock frequency of the microprocessors has increased 1000–fold. As remarkable as this evolution is, it has been headed in a direction diametrically opposite to the computing paradigm of the brain. Consequently, today's microprocessors are eight orders of magnitude faster (in terms of clock rate) and four orders of magnitude hotter (in terms of power per unit cortical area) than the brain.
Considering overall energy consumption underscores the divergence between the brain and today's computers even more starkly. Note that a “human–scale” simulation with one hundred trillion synapses (with relatively simple models of neurons and synapses) required 96 Blue Gene/Q racks of the Lawrence Livermore National Lab Sequoia supercomputer–and, yet, the simulation ran 1500 times slower than real–time. A hypothetical computer to run this simulation in real–time would require 12GW, whereas the human brain consumes merely 20W.
What explains this disparity? There are two factors: technology and architecture. Unlike today's inorganic silicon technology, the brain uses biophysical, biochemical, organic wetware. While enormous advances in future enabling nanotechnology are underway, we focused on the second factor: architecture innovation–specifically, on minimizing the product of power, area, and delay in a system that could be implemented in today's state–of–the–art technology.
The cerebral cortex is hypothesized to comprise repeating canonical cortical microcircuits. Inspired by this hypothesis, in 2011, we demonstrated an event–driven “worm–scale” neurosynaptic core that integrated computation and memory. Now, we have shrunk the neurosynaptic core by 15–fold in area and 100–fold in power, and have tiled 4,096 cores via an on–chip network to create TrueNorth–with one million neurons and 256 million synapses. It is worth noting that we had only committed to deliver a chip with 1,024 cores, but, in November 2011, as a team, we made a gutsy decision to increase the scale four–fold to 4,096 cores. Fabricated in Samsung's 28nm process, with 5.4 billion transistors, TrueNorth is IBM's largest chip to date in transistor count. While simulating complex recurrent neural networks, TrueNorth consumes < 100mW of power and has a power density of 20mW / cm2.
Unlike the prevailing von Neumann architecture–but like the brain–TrueNorth has a parallel, distributed, modular, scalable, fault–tolerant, flexible architecture that integrates computation, communication, and memory and has no clock. It is fair to say that TrueNorth completely redefines what is now possible in the field of brain–inspired computers, in terms of size, architecture, efficiency, scalability, and chip design techniques.
Designing and testing TrueNorth was no cakewalk. Its unprecedented size, unconventional architecture, new hybrid synchronous–asynchronous circuit methodology, and a new unfamiliar technology process required custom design, verification, and testing methodologies that demanded innovation, team work, and project management at the highest level. A critical element was one–to–one equivalence–at the functional level of spikes–between TrueNorth and our software simulator Compass. This equivalence allowed us to begin developing applications long before chips returned from the foundry and to verify correctness of the chip logic.
Having exhausted all available means and tools for verifying the chip before fabrication, to ensure no stone was left unturned I even offered a $1,000 bottle of champagne to anyone who could find a bug. None was found. It was not until a year later–after the chip passed all unit, regression, functional, and multi–chip communication tests–that we were certain no fatal bugs existed. My champagne was safe!
The project simply could not have succeeded without the innovative spirit and the tremendous dedication of the current core team. Cornell Tech provided the critically important asynchronous circuit design tools that we jointly refined over the course of the project. Collaboration with Samsung was critical in gaining access to their advanced 28nm foundry process that allowed balancing the low active power of the architecture with matching low power of the underlying silicon technology. I am immensely grateful to our 200+ collaborators since 2008–spanning eight IBM labs and fabs, five universities, one start–up, and two Department of Energy laboratories. Finally, DARPA's mandate, metrics, and investment were absolutely vital.
Standing, Left to right: Myron Flickner, Tobi Delbruck, Jun Sawada, Bryan Jackson, Wendy Belluomini, Tim Melano, Marc Gonzalez–Tallada, Bill Risk, Kumar Appuswamy, Rodrigo Alvarez–Icaza, Brian Taba, Ben Shaw, Sue Gilson, Arvind Kumar, Norm Pass, Luca Longinotti, Arnon Amir, John Best, Scott Lekuch, Rajit Manohar, Steve Esser, John Arthur
Sitting, Right to Left: Filipp Akopyan, Arash Shokouhbakhsh, Sim Bamford, Paul Merola, David Barch, Dharmendra S. Modha, David Berg, Jeff Kusnitz, Alexander Andreoupoulous, Andrew Cassidy
Photo Credit: Hita Bambhania-Modha, photo free to use without restrictions!
Link to high-resolution image is here.
Let's be clear: we have not built the brain, or any brain here. We have built a computer that is inspired by the brain. The inputs to and outputs of this computer are spikes. Functionally, it transforms a spatio–temporal stream of input spikes into a spatio–temporal stream of output spikes.
If one were to measure activities of one million neurons in TrueNorth, one would see something akin to a night cityscape with blinking lights. Given this unconventional computing paradigm, compiling C++ to TrueNorth is like using a hammer for a screw. As a result, to harness TrueNorth, we have designed an end–to–end ecosystem complete with a new simulator, a new programming language, an integrated programming environment, new libraries, new (and old) algorithms as well as applications, and a new teaching curriculum (affectionately called, “SyNAPSE University”). The goal of the ecosystem is to dramatically increase programmer productivity. Metaphorically, if TrueNorth is ENIAC, then our ecosystem is the corresponding FORTRAN.
We are working, at a feverish pace, to make the ecosystem available–as widely as possible–to IBMers, universities, business partners, start–ups, and customers. In collaboration with the international academic community, by leveraging the ecosystem, we foresee being able to map the existing body of neural network algorithms to the architecture in an efficient manner, as well as being able to imagine and invent entirely new algorithms.
To support these algorithms at ever increasing scale, TrueNorth chips can be seamlessly tiled to create vast, scalable neuromorphic systems. In fact, we have already built systems with 16 million neurons and 4 billion synapses. Our sights are now set high on the ambitious goal of integrating 4,096 chips in a single rack with 4 billion neurons and 1 trillion synapses while consuming ~4kW of power.
The architecture can solve a wide class of problems from vision, audition, and multi–sensory fusion, and has the potential to revolutionize the computer industry by integrating brain–like capability into devices where computation is constrained by power and speed. These systems can efficiently process high–dimensional, noisy sensory data in real time, while consuming orders of magnitude less power than conventional computer architectures.
On one hand–with portable devices–think smart phones, sensor networks, self–driving automobiles, robots, public safety, medical imaging, real–time video analysis, signal processing, olfactory detection, and digital pathology. On the other hand–with synaptic supercomputers–think multimedia processing on the cloud. In addition, our chip can be used in combination with other cognitive computing technologies to create systems that learn, reason and help humans make better decisions. Over time, our hope is that SyNAPSE will become an integral component of IBM WATSON group offerings.
We have been working with iniLabs Ltd., creators of a retinal camera–the DVS–that directly produces spikes, which are the natural inputs for TrueNorth. Integrating the two, we have begun investigating extremely low–power end–to–end vision systems.
If we think of today's von Neumann computers as akin to the “left–brain”–fast, symbolic, number–crunching calculators, then TrueNorth can be likened to the “right–brain”–slow, sensory, pattern recognizing machines.
We envision augmenting our neurosynaptic cores with synaptic plasticity to create a new generation of field–adaptable neurosynaptic computers capable of online learning.
I was not there when ENIAC was unveiled, but I have a palpable sense that we are at a similar turning point in the history of computing. The technological and practical possibilities are immense and could touch every sphere of science, technology, business, government, and society. I am optimistic that the enduring value of our work will be the inspiration of a completely different way of thinking about computing. It will, I believe, spawn an outpouring of creativity by universities, startups, established tech companies, and by professionals in countless industries and occupations.
We are not there yet. Indeed, TrueNorth is a direction and not a destination! The end goal is building intelligent business machines that enable a cognitive planet, while transforming industries. Exciting!
The postings on this site are my own and don’t necessarily represent IBM’s positions, strategies or opinions.