Welcome!

Linux Containers Authors: Liz McMillan, Vaibhaw Pandey, Elizabeth White, Pat Romanski, Yeshim Deniz

Related Topics: Linux Containers

Linux Containers: Article

Open Source "Spying" On Open Source: The CIA Project

LinuxWorld Exclusive Interview With Micah Dowty, Founder and Principal Contributor

LinuxWorld.com recently had the pleasure to interview Micah Dowty, founder and principal contributor to a rather unique project aptly named CIA (http://cia.navi.cx/).  CIA is a project that monitors a wide range of open source projects in real time tracking changes, building statistics, and alerting through a number of channels on events.
 
[LW] Tell us a brief history of yourself and how you came up with the idea for CIA.  What were you trying to solve?
 
[MT] CIA is really the survivor in a chain of failed projects. It started with the Kiwi, my attempt at building a very inexpensive and completely open PDA device. That project failed at its original goals, but I did end up with a "from scratch" Motorola 68k board that booted Linux, which taught me a lot about embedded systems.
 
During the Kiwi's development, I decided I needed to write a GUI. Honestly, this probably originated as Not Invented Here syndrome, but its architecture evolved into something really interesting to me: the PicoGUI project.
 
There might still be people using PicoGUI today, but I lost interest in it a couple years ago. Luckily for CIA, at some point we decided PicoGUI needed a bot reporting commits to our IRC channel.
 
This early bot was the first incarnation of CIA. It was a quick afternoon hack, written by myself and named by Lalo Martins. About a week later, Mike Hearn suggested modifying it to work with any number of projects, and putting it in a central IRC channel. This was June 1, 2003, the birth of #commits on Freenode. In just the space of a few hours, #commits grew from nothing to about the size it is today.
 
Originally, CIA was just created to make the PicoGUI project easier and more fun to keep track of. When we set up #commits, the motivation was mostly just for the novelty of seeing what everyone else is working on at any given time. I think it was much later that we realized just how useful CIA could be to the projects using it. This is one of the things that sparked the complete rewrite in December 2003.
 
[LW] CIA is open-source looking/spying on open-source.  How has your project been received by many of the larger open-source bodies?
 
[MT] The response to CIA has been very positive. Really the only negative comments I remember getting are related to server downtime or bugs in the IRC code. CIA has been pretty reliable on its current home, but there have been periods of time in the past when for either software or hardware reasons it was crashing all the time. It seemed like every time it went down for a few hours I'd get someone threatening to reimplement CIA as a 50 line shell script. Of course, that's pretty much what CIA was before its rewrite- and there are a lot of advantages that this 15,000 lines of Python have over the old pile of shell scripts.
 
There are several large projects that are making use of CIA and showing their support by linking to the web interface. Gaim, AnhkSVN, Enlightenment, Gentoo, Adium, and Beagle are just a few of the larger projects that use CIA and link to it prominently on their web sites.  I don't think CIA has received any official endorsements by large open-source projects or organizations, but some powerful members of these organizations have shown interest. Nat Friedman of Gnome fame was quite excited about CIA and sent a big donation.
 
[LW] What do you see as the top 3 features of CIA?
 
[MT] I think CIA's top feature is that anyone can use it, and it's about as easy to set up as possible for the version control system you're using. With about the same effort it would take to set up a commits mailing list, you can connect your project to a server that will get your commits onto IRC, the web, and RSS.
 
The next best thing about CIA is how it isn't tied to any particular version control system. Internally, CIA is just an architecture for publishing, filtering, and formatting arbitrary messages. CIA supports version control systems I've never used, and it's being used for more esoteric purposes like reporting automated build results. I know I've seen several projects out there for mailing commit messages or generating RSS feeds, but they're all designed for one specific version control system. CIA's client scripts act as an abstraction layer, so by writing a new client you can use it with pretty much anything.
 
The web interface has always been secondary to IRC commit delivery, but I see its ability to create a community of projects as the next most important feature of CIA. Every person and every project on CIA automatically gets a web page, and they're all linked together. Each page has a "related" box that lets you see who works on a particular project, what projects a particular author works on, which version control systems an author uses regularly, etc. These associations actually form an undirected graph that ends up tying most projects together in some way. Back when CIA was smaller, we could visualize this graph. Nowadays it just takes way too much CPU time.
 
[LW] How have "users" used the data, stats, and events published from the CIA Notification server?
 
Many people link to their author page from a personal homepage or blog, and more and more projects are including links from their web site to their CIA stats page. A few projects are including CIA stats directly on their web site using RSS aggregators. CIA does provide a low-level XML feed with more detailed stats, and there's an XML-RPC interface that gives you easy programmatic access to all the data used to generate the web site. I don't think anyone is actually making use of this yet, but it's hard to expect people to use interfaces I haven't got around to documenting yet.
 
The coolest practical use of CIA I've seen recently was on the Planet Gentoo site. Since Gentoo contributors have the same username everywhere, they could link every blog post directly to that user's CIA stats page.
 
I expect people will find even more diverse ways to use CIA once I make the details of the XML-RPC interface well-known. I'm also really hoping that publish-subscribe becomes more common, as polling the RSS feeds really generates a huge amount of web traffic.
 
[LW] The community aspect of CIA is interesting to learn of, with people making a big play of their own 'commit' status.  Do you see the need to feed peoples egos is a big part of what CIA can deliver?
 
[MT] Definitely. Commit reporting has been done before, but one of the things that makes CIA really unique is that it brings projects together into a larger community. Anybody's CVS to RSS gateway or commit mailing list can be useful to developers in pretty much the same way, but CIA has a way of introducing a bit of healthy competition. People love seeing their work Show up in public IRC channels. It seems less like they're locked in a closet pounding away at code in isolation, and more like they're doing something interactive that everyone else can see. CIA lets everyone know when you're making progress and gives you a virtual pat on the back for it. I know many people have trouble developing when CIA isn't around, since it just isn't quite as much fun.
 
[LW] CIA is watching itself, which is pretty cool.  Have you had much help from the community development wise?
 
User contributions have been very important to the CIA client scripts, and in defining the XML message format. I wrote the client script for Subversion repositories, but all other clients were contributed by users. On the server side though, I've been mostly alone. The server's codebase is pretty clean and well-organized, but it's big and largely undocumented. The server is tricky to set up, and it has a steep learning curve, so it has much less appeal for random hacking than the client scripts.
 
 [LW] Fundamentally CIA requires a small piece of script to be installed in the CVS/SVN servers to alert it when something changes.  How do you go about asking for support from say SourceForge based projects? Have they been supportive?
 
[MT] CIA has spread really well just by word of mouth. Generally a project admin or enthusiast hears about CIA, sets it up, then the first news I get about it is a request for a metadata key or IRC bot. When the project was brand new Mike, Lalo, and I advertised it to a few other projects and set up scripts to scrape commits off of email lists. There are still a few projects that are connected to CIA via mailing lists, but the vast majority of projects were set up without any direct encouragement from us.
 
[LW] What are the longer term plans of CIA?  Where do you see it heading?
 
There are some loose ends that I'd like to tie up, like web-based registration for IRC bots and metadata keys. I'm sure there are more bugfixes to be had. That's all just polishing what's already there I don't see CIA changing a whole lot, just becoming easier to use, more robust, and more scalable. CIA already has a lot of feature bloat for what it is, really. The biggest change I see happening in CIA's future is making it easier for people to set up their own CIA servers in such a way that the load can be shared across many machines but the large-scale relationships between people and their work can be maintained.
 

Micah Dowty: Bio Details

Micah started tinkering with electronics and software at a very early age thanks to having an engineer for a father and a teacher for a mother. He finds himself learning more from his own personal projects than from school, and he has also contributed to a handful of larger open source projects including BZFlag, Crystal Space, and the Linux kernel.
 
 
  
 

More Stories By Alan Williamson

Alan Williamson is widely recognized as an early expert on Cloud Computing, he is Co-Founder of aw2.0 Ltd, a software company specializing in deploying software solutions within Cloud networks. Alan is a Sun Java Champion and creator of OpenBlueDragon (an open source Java CFML runtime engine). With many books, articles and speaking engagements under his belt, Alan likes to talk passionately about what can be done TODAY and not get caught up in the marketing hype of TOMORROW. Follow his blog, http://alan.blog-city.com/ or e-mail him at cloud(at)alanwilliamson.org.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@ThingsExpo Stories
"Akvelon is a software development company and we also provide consultancy services to folks who are looking to scale or accelerate their engineering roadmaps," explained Jeremiah Mothersell, Marketing Manager at Akvelon, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
In his session at 21st Cloud Expo, Carl J. Levine, Senior Technical Evangelist for NS1, will objectively discuss how DNS is used to solve Digital Transformation challenges in large SaaS applications, CDNs, AdTech platforms, and other demanding use cases. Carl J. Levine is the Senior Technical Evangelist for NS1. A veteran of the Internet Infrastructure space, he has over a decade of experience with startups, networking protocols and Internet infrastructure, combined with the unique ability to it...
"Space Monkey by Vivent Smart Home is a product that is a distributed cloud-based edge storage network. Vivent Smart Home, our parent company, is a smart home provider that places a lot of hard drives across homes in North America," explained JT Olds, Director of Engineering, and Brandon Crowfeather, Product Manager, at Vivint Smart Home, in this SYS-CON.tv interview at @ThingsExpo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Gemini is Yahoo’s native and search advertising platform. To ensure the quality of a complex distributed system that spans multiple products and components and across various desktop websites and mobile app and web experiences – both Yahoo owned and operated and third-party syndication (supply), with complex interaction with more than a billion users and numerous advertisers globally (demand) – it becomes imperative to automate a set of end-to-end tests 24x7 to detect bugs and regression. In th...
"IBM is really all in on blockchain. We take a look at sort of the history of blockchain ledger technologies. It started out with bitcoin, Ethereum, and IBM evaluated these particular blockchain technologies and found they were anonymous and permissionless and that many companies were looking for permissioned blockchain," stated René Bostic, Technical VP of the IBM Cloud Unit in North America, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Conventi...
Large industrial manufacturing organizations are adopting the agile principles of cloud software companies. The industrial manufacturing development process has not scaled over time. Now that design CAD teams are geographically distributed, centralizing their work is key. With large multi-gigabyte projects, outdated tools have stifled industrial team agility, time-to-market milestones, and impacted P&L stakeholders.
"Cloud Academy is an enterprise training platform for the cloud, specifically public clouds. We offer guided learning experiences on AWS, Azure, Google Cloud and all the surrounding methodologies and technologies that you need to know and your teams need to know in order to leverage the full benefits of the cloud," explained Alex Brower, VP of Marketing at Cloud Academy, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clar...
Widespread fragmentation is stalling the growth of the IIoT and making it difficult for partners to work together. The number of software platforms, apps, hardware and connectivity standards is creating paralysis among businesses that are afraid of being locked into a solution. EdgeX Foundry is unifying the community around a common IoT edge framework and an ecosystem of interoperable components.
"MobiDev is a software development company and we do complex, custom software development for everybody from entrepreneurs to large enterprises," explained Alan Winters, U.S. Head of Business Development at MobiDev, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Coca-Cola’s Google powered digital signage system lays the groundwork for a more valuable connection between Coke and its customers. Digital signs pair software with high-resolution displays so that a message can be changed instantly based on what the operator wants to communicate or sell. In their Day 3 Keynote at 21st Cloud Expo, Greg Chambers, Global Group Director, Digital Innovation, Coca-Cola, and Vidya Nagarajan, a Senior Product Manager at Google, discussed how from store operations and ...
"There's plenty of bandwidth out there but it's never in the right place. So what Cedexis does is uses data to work out the best pathways to get data from the origin to the person who wants to get it," explained Simon Jones, Evangelist and Head of Marketing at Cedexis, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
SYS-CON Events announced today that CrowdReviews.com has been named “Media Sponsor” of SYS-CON's 22nd International Cloud Expo, which will take place on June 5–7, 2018, at the Javits Center in New York City, NY. CrowdReviews.com is a transparent online platform for determining which products and services are the best based on the opinion of the crowd. The crowd consists of Internet users that have experienced products and services first-hand and have an interest in letting other potential buye...
SYS-CON Events announced today that Telecom Reseller has been named “Media Sponsor” of SYS-CON's 22nd International Cloud Expo, which will take place on June 5-7, 2018, at the Javits Center in New York, NY. Telecom Reseller reports on Unified Communications, UCaaS, BPaaS for enterprise and SMBs. They report extensively on both customer premises based solutions such as IP-PBX as well as cloud based and hosted platforms.
It is of utmost importance for the future success of WebRTC to ensure that interoperability is operational between web browsers and any WebRTC-compliant client. To be guaranteed as operational and effective, interoperability must be tested extensively by establishing WebRTC data and media connections between different web browsers running on different devices and operating systems. In his session at WebRTC Summit at @ThingsExpo, Dr. Alex Gouaillard, CEO and Founder of CoSMo Software, presented ...
WebRTC is great technology to build your own communication tools. It will be even more exciting experience it with advanced devices, such as a 360 Camera, 360 microphone, and a depth sensor camera. In his session at @ThingsExpo, Masashi Ganeko, a manager at INFOCOM Corporation, introduced two experimental projects from his team and what they learned from them. "Shotoku Tamago" uses the robot audition software HARK to track speakers in 360 video of a remote party. "Virtual Teleport" uses a multip...
A strange thing is happening along the way to the Internet of Things, namely far too many devices to work with and manage. It has become clear that we'll need much higher efficiency user experiences that can allow us to more easily and scalably work with the thousands of devices that will soon be in each of our lives. Enter the conversational interface revolution, combining bots we can literally talk with, gesture to, and even direct with our thoughts, with embedded artificial intelligence, whic...
SYS-CON Events announced today that Evatronix will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Evatronix SA offers comprehensive solutions in the design and implementation of electronic systems, in CAD / CAM deployment, and also is a designer and manufacturer of advanced 3D scanners for professional applications.
Leading companies, from the Global Fortune 500 to the smallest companies, are adopting hybrid cloud as the path to business advantage. Hybrid cloud depends on cloud services and on-premises infrastructure working in unison. Successful implementations require new levels of data mobility, enabled by an automated and seamless flow across on-premises and cloud resources. In his general session at 21st Cloud Expo, Greg Tevis, an IBM Storage Software Technical Strategist and Customer Solution Architec...
To get the most out of their data, successful companies are not focusing on queries and data lakes, they are actively integrating analytics into their operations with a data-first application development approach. Real-time adjustments to improve revenues, reduce costs, or mitigate risk rely on applications that minimize latency on a variety of data sources. In his session at @BigDataExpo, Jack Norris, Senior Vice President, Data and Applications at MapR Technologies, reviewed best practices to ...
An increasing number of companies are creating products that combine data with analytical capabilities. Running interactive queries on Big Data requires complex architectures to store and query data effectively, typically involving data streams, an choosing efficient file format/database and multiple independent systems that are tied together through custom-engineered pipelines. In his session at @BigDataExpo at @ThingsExpo, Tomer Levi, a senior software engineer at Intel’s Advanced Analytics gr...