Welcome!

Linux Containers Authors: Elizabeth White, Anders Wallgren, Derek Weeks, Tim Hinds, Robert Reeves

Related Topics: Linux Containers

Linux Containers: Article

VM: Virtual Memory or Virtual Mayhem?

How to solve the virtual memory problem in all Linux kernels.

(LinuxWorld) -- The fur has been flying in the Linux kernel development community of late, particularly because there's a lot of contention about how Linux should do virtual memory management (VM). Andrea Arcangeli and Rik van Riel have the two most visible opinions on how the VM should work, so they're often seen as at the heart of the struggle. However, it goes much deeper than that. The VM even seems like a hot button between Linus Torvalds and Alan Cox, although I'm guessing they're not as passionate about the issue as the journalists would like them to be to get mileage out of the topic.

Take us, for example. None other than our own Joshua Drake called the 2.4 Kernel the Kernel of pain, primarily because of problems with the VM algorithms and changes. Yours truly complained a little about the VM last August.

I'm about to weigh in once again about how I feel about VM, only in more detail this time. We're definitely into milking this topic for all it's worth.

After looking at the various VM algorithms in play, I've come up with an alternative. I'm sure Andrea Arcangeli and Rik van Riel are flooded with the same old suggestions over and over again, and it's entirely possible they've heard this one before. Nevertheless, I'll propose it, anyway.

First, let me walk you through a little of the history that gave me the notion of how best to deal with the Linux VM issue.

The brain-dead system that wouldn't die

Back when I was a programmer by trade, one of the projects I worked on involved Fortran programming for a farm of PDP-11s. I honestly don't remember which OS we had installed on the PDPs, but if I had to guess I'd say it was probably RSX-11/M.

What I'll never forget, however, is the one quirk I hated beyond all others. You couldn't run executable files unless they were stored in contiguous blocks on the disk.

Programmers are a virtual disk frag factory, so we ran out of contiguous space almost daily. This means we often found ourselves in the uncomfortable situation where we could no longer test the modifications to our programs until someone defragged the drives. The worst of it was that, if there was such a thing as a disk defrag utility available, the systems guys obviously didn't know about it. They solved the problem by taking the systems off line, backing up, and then restoring the contents of the drives. We couldn't even edit code while we waited.

By the way, Microsoft credits Windows NT architect Dave Cutler as the designer of RSX-11/M (see resources). As tempting as it might be to compare the brain-dead pieces of Windows NT to the bizarre behavior of RSX-11/M, it would be unfair to do so for two reasons. First, more reliable sources than Microsoft only credit Cutler for the design of VMS. Cutler borrowed some RSX-11/M code for VMS, but didn't design RSX-11/M. Second, RSX-11/M was probably only brain-dead because the PDP-11 wasn't much of a brain.

Any computer historian knows the PDP-11 was a breakthrough in affordable computing in its time. Compared to today's desktops, it would be like taking out a second mortgage to buy an abacus. A PDP-11 computer with 8K words of core memory and a 256K disk cost about $30,000 in 1972. It could run you thousands of dollars for an additional 4K of memory.

It may be possible that the RSX-11/M architect was an idiot. It is also possible that the resident portion of the RSX-11/M kernel would have to exceed the size of the typical memory configuration for the PDP-11 in order to change this behavior, which would inflate the price of the system in the process.

I don't happen to recall how much memory we had on our PDP-11s or how much of it was used by RSX-11/M. However, I do know we never had enough. I had to implement overlays to make my Fortran programs work. In case you're not an old fart as I am, overlays are a bit like implementing virtual memory at the application level instead of the OS. You split off parts of your program into modules called overlays. The main program loads an overlay whenever it needs the functions in that overlay.

The trick is to make sure that none of your overlays rely on functions that reside in any other overlays, because your goal is to have only one overlay in memory at a time. I'm not even sure it was possible to have more than one loaded, but it's been too long to remember. Regardless, I'd bet the company paid at least double my yearly salary for some of those PDP-11s, so it was worth the relatively minimal effort for me to break up my program into overlays.

The new VM proposal

What does this have to do with virtual memory performance on the Linux 2.4 kernel?

Everything.

First, think about what the VM is for. The VM is like an extremely sophisticated automated overlay system, only it deals with many more types of memory storage. It is the part of the OS that comes to the rescue when you have used up all your expensive memory and need more.

Here's how it works. The OS finds some data in memory that can safely be removed and stores it to the cheaper storage (disk swap space), which frees up some expensive memory for other use. When a program needs the data that has been swapped to disk, the OS swaps something else out to disk and brings the needed data back into memory.

Most of the arguments about the VM in Linux revolve around how the OS should decide which memory is swapped and when, and methods to make the process fast and painless.

The remaining arguments are usually about what the OS needs to do when you've filled up all available memory and swap space but some task still needs even more memory. In this case, most people agree that the OS needs to kill one or more running tasks to free up memory. Since you're talking about stopping programs dead in their tracks, you have to address the issue of how the OS decides which tasks are less important than others and can afford to be killed.

If you don't see how this relates to my PDP story yet, then here's a hint. The controversy over the VM involves several extremely talented programmers, each of whom could command hundreds of thousands of dollars per year in salaries. They have been spending a great deal of their time and brain power over the past years figuring out how to squeeze the best performance out of systems with limited RAM and drive space.

Still don't get it? Then let me get right to my proposal for a new VM algorithm for Linux. Granted, this VM algorithm is not meant for typical system loads, but it would solve the most annoying VM problems.

I propose that we create a kernel daemon that checks for either of the following two conditions:

  1. The system swaps to disk so much that you see a severe degradation in performance.
  2. A task needs memory after all available RAM and swap is filled.

If either condition is met, the kernel then kills all tasks except those it needs to display the following message on the screen: "Lay off the doughnuts this week and spend the money to buy another DIMM, you penny pinching skinflint!"

As a bonus, the daemon could check the Internet for current pricing and replace the part about "doughnuts" with some comparison that better represents the current state of the market.

Again, I apologize to Andrea and Rik if this has already been suggested, and I suspect it has. Nevertheless, it was therapeutic, if not useful, to offer the advice.

More Stories By Nicholas Petreley

Nicholas Petreley is a computer consultant and author in Asheville, NC.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@ThingsExpo Stories
SYS-CON Events announced today that DatacenterDynamics has been named “Media Sponsor” of SYS-CON's 18th International Cloud Expo, which will take place on June 7–9, 2016, at the Javits Center in New York City, NY. DatacenterDynamics is a brand of DCD Group, a global B2B media and publishing company that develops products to help senior professionals in the world's most ICT dependent organizations make risk-based infrastructure and capacity decisions.
So, you bought into the current machine learning craze and went on to collect millions/billions of records from this promising new data source. Now, what do you do with them? Too often, the abundance of data quickly turns into an abundance of problems. How do you extract that "magic essence" from your data without falling into the common pitfalls? In her session at @ThingsExpo, Natalia Ponomareva, Software Engineer at Google, will provide tips on how to be successful in large scale machine lear...
The IoT has the potential to create a renaissance of manufacturing in the US and elsewhere. In his session at 18th Cloud Expo, Florent Solt, CTO and chief architect of Netvibes, will discuss how the expected exponential increase in the amount of data that will be processed, transported, stored, and accessed means there will be a huge demand for smart technologies to deliver it. Florent Solt is the CTO and chief architect of Netvibes. Prior to joining Netvibes in 2007, he co-founded Rift Technol...
The increasing popularity of the Internet of Things necessitates that our physical and cognitive relationship with wearable technology will change rapidly in the near future. This advent means logging has become a thing of the past. Before, it was on us to track our own data, but now that data is automatically available. What does this mean for mHealth and the "connected" body? In her session at @ThingsExpo, Lisa Calkins, CEO and co-founder of Amadeus Consulting, will discuss the impact of wea...
SYS-CON Events announced today that Peak 10, Inc., a national IT infrastructure and cloud services provider, will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. Peak 10 provides reliable, tailored data center and network services, cloud and managed services. Its solutions are designed to scale and adapt to customers’ changing business needs, enabling them to lower costs, improve performance and focus inter...
We’ve worked with dozens of early adopters across numerous industries and will debunk common misperceptions, which starts with understanding that many of the connected products we’ll use over the next 5 years are already products, they’re just not yet connected. With an IoT product, time-in-market provides much more essential feedback than ever before. Innovation comes from what you do with the data that the connected product provides in order to enhance the customer experience and optimize busi...
In his session at @ThingsExpo, Chris Klein, CEO and Co-founder of Rachio, will discuss next generation communities that are using IoT to create more sustainable, intelligent communities. One example is Sterling Ranch, a 10,000 home development that – with the help of Siemens – will integrate IoT technology into the community to provide residents with energy and water savings as well as intelligent security. Everything from stop lights to sprinkler systems to building infrastructures will run ef...
Digital payments using wearable devices such as smart watches, fitness trackers, and payment wristbands are an increasing area of focus for industry participants, and consumer acceptance from early trials and deployments has encouraged some of the biggest names in technology and banking to continue their push to drive growth in this nascent market. Wearable payment systems may utilize near field communication (NFC), radio frequency identification (RFID), or quick response (QR) codes and barcodes...
SYS-CON Events announced today that Stratoscale, the software company developing the next generation data center operating system, will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. Stratoscale is revolutionizing the data center with a zero-to-cloud-in-minutes solution. With Stratoscale’s hardware-agnostic, Software Defined Data Center (SDDC) solution to store everything, run anything and scale everywhere...
Angular 2 is a complete re-write of the popular framework AngularJS. Programming in Angular 2 is greatly simplified – now it's a component-based well-performing framework. This immersive one-day workshop at 18th Cloud Expo, led by Yakov Fain, a Java Champion and a co-founder of the IT consultancy Farata Systems and the product company SuranceBay, will provide you with everything you wanted to know about Angular 2.
You think you know what’s in your data. But do you? Most organizations are now aware of the business intelligence represented by their data. Data science stands to take this to a level you never thought of – literally. The techniques of data science, when used with the capabilities of Big Data technologies, can make connections you had not yet imagined, helping you discover new insights and ask new questions of your data. In his session at @ThingsExpo, Sarbjit Sarkaria, data science team lead ...
SYS-CON Events announced today that Men & Mice, the leading global provider of DNS, DHCP and IP address management overlay solutions, will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. The Men & Mice Suite overlay solution is already known for its powerful application in heterogeneous operating environments, enabling enterprises to scale without fuss. Building on a solid range of diverse platform support,...
You deployed your app with the Bluemix PaaS and it's gaining some serious traction, so it's time to make some tweaks. Did you design your application in a way that it can scale in the cloud? Were you even thinking about the cloud when you built the app? If not, chances are your app is going to break. Check out this webcast to learn various techniques for designing applications that will scale successfully in Bluemix, for the confidence you need to take your apps to the next level and beyond.
Whether your IoT service is connecting cars, homes, appliances, wearable, cameras or other devices, one question hangs in the balance – how do you actually make money from this service? The ability to turn your IoT service into profit requires the ability to create a monetization strategy that is flexible, scalable and working for you in real-time. It must be a transparent, smoothly implemented strategy that all stakeholders – from customers to the board – will be able to understand and comprehe...
Increasing IoT connectivity is forcing enterprises to find elegant solutions to organize and visualize all incoming data from these connected devices with re-configurable dashboard widgets to effectively allow rapid decision-making for everything from immediate actions in tactical situations to strategic analysis and reporting. In his session at 18th Cloud Expo, Shikhir Singh, Senior Developer Relations Manager at Sencha, will discuss how to create HTML5 dashboards that interact with IoT devic...
Artificial Intelligence has the potential to massively disrupt IoT. In his session at 18th Cloud Expo, AJ Abdallat, CEO of Beyond AI, will discuss what the five main drivers are in Artificial Intelligence that could shape the future of the Internet of Things. AJ Abdallat is CEO of Beyond AI. He has over 20 years of management experience in the fields of artificial intelligence, sensors, instruments, devices and software for telecommunications, life sciences, environmental monitoring, process...
SYS-CON Events announced today that Ericsson has been named “Gold Sponsor” of SYS-CON's @ThingsExpo, which will take place on June 7-9, 2016, at the Javits Center in New York, New York. Ericsson is a world leader in the rapidly changing environment of communications technology – providing equipment, software and services to enable transformation through mobility. Some 40 percent of global mobile traffic runs through networks we have supplied. More than 1 billion subscribers around the world re...
There is an ever-growing explosion of new devices that are connected to the Internet using “cloud” solutions. This rapid growth is creating a massive new demand for efficient access to data. And it’s not just about connecting to that data anymore. This new demand is bringing new issues and challenges and it is important for companies to scale for the coming growth. And with that scaling comes the need for greater security, gathering and data analysis, storage, connectivity and, of course, the...
Machine Learning helps make complex systems more efficient. By applying advanced Machine Learning techniques such as Cognitive Fingerprinting, wind project operators can utilize these tools to learn from collected data, detect regular patterns, and optimize their own operations. In his session at 18th Cloud Expo, Stuart Gillen, Director of Business Development at SparkCognition, will discuss how research has demonstrated the value of Machine Learning in delivering next generation analytics to im...
This is not a small hotel event. It is also not a big vendor party where politicians and entertainers are more important than real content. This is Cloud Expo, the world's longest-running conference and exhibition focused on Cloud Computing and all that it entails. If you want serious presentations and valuable insight about Cloud Computing for three straight days, then register now for Cloud Expo.