version control systems
overviews, discussion & transitions
- “Cutting Edge Revision Control” circa 2007 > http://bryan-murdock.blogspot.com/2007/03/cutting-edge-revision-control.html
- a comparison of available free SCM (source code management) systems http://better-scm.berlios.de/
- .. and a guide http://zooko.com/revision_control_quick_ref.html
- an email from jeff rose talking about several systems, from oktober 07, is at the bottom of this page.
- distributed version control systems → DVCS
- using version control, a guide for the lirec project version control guide
- from SVN to Hg > http://www.python.org/dev/peps/pep-0374/
CVS
standard, widely used 'legacy system' with some serious scalabity + flexibilty problems. has client software for unix-likes, win32 + macOS8/9/X http://www.cvshome.org
Note that the Mac clients mentioned here are all screwy. Go To http://sourceforge.net/projects/maccvspro to get an up to date and (hopefully) usable version. Instructions for setting up are in the TechSpec (section 12, page 34 of the current version dec 2002). Note especially the requirement for CarbonLib. (And MacOS 8.6 or higher). The URL in the TechSpec is almost correct - the current version is 1.6, so use http://download.info.apple.com/Apple_Support_Area/Apple_Software_Updates/English-North_American/Macintosh/System/Other_System/CarbonLib_1.6.smi.bin
multiplatform effort to build a gui for CVS → http://cvsgui.sourceforge.net/
see: Using CVS
darcs
distributed, patchset based system, easy to setup + configure, not too many weird filenames (was arch deliberately written to stresstest filesystem syntax?) not dependent on massive centralisaed server setup (apache/dav/neon/db/etc+). win32/unix compliable (if theres a haskell compiler around). cvs conversion scripts. how well does it handle binaries?
see: Using Darcs
subversion
attempts to deal with some of the shortcomming of cvs. http://subversion.tigris.org/
- All current CVS features
- Directories, renames, and file meta-data are versioned.
- Symbolic links, etc, are supported
- Commits are truly atomic.
- Internationalization
- Progressive multi-lingual support
- Branching and tagging are cheap (constant time) operations
- Repeated merges are handled gracefully
- Support for plug-in client side diff programs
- Natively client/server
- Client/server protocol sends diffs in both directions
- Costs are proportional to change size, not project size
reading
- “the book” http://svnbook.red-bean.com/
drawbacks
- stapling extra legs on a dog to make an octopus
arch
“arch is a revision control system with features that are ideal for free software and open source projects characterized by widely distributed development, concurrent support of multiple releases, and substantial amounts of development on branches. It is intended to replace CVS and corrects many mis-features of that system.” http://www.regexps.com/#arch Some quick highlights of the feature list are:
- distributed databases – each hacker or group can host their own branches. There's a global (world wide) name-space for lines of development and revisions. Branches can be formed from any repository to any other and merge operations can span repository boundaries without needing to actually duplicate the full contents of a repository at each site.
- fancy merging – `arch' has support for various styles of history-sensitive branch merging. The way branches and patch-sets interact with distributed repositories makes it practical to distribute the responsibilities for patch-review and merging.
- renames handled – of course file and directory renames are handled accurately. So are symbolic links and file permissions.
- unobtrusive operation – `arch' is designed to stay out your way while making changes and rearranging files. It is designed to have a clean and self-documenting command-line interface having the finest characteristics of good Unix tools.
- web interface – `arch' includes web tools for browsing past revisions.
- revision library – `arch' includes tools for building a space efficient library of all past revisions, represented as ordinary file system trees.
- very fast– when configured correctly, `arch' is quite fast.
see: ARCH for more info
aegis
“Aegis is a transaction-based software configuration management system. It provides a framework within which a team of developers may work on many changes to a program independently, and Aegis coordinates integrating these changes back into the master source of the program, with as little disruption as possible.” http://aegis.sourceforge.net/
mercurial / Hg
another DVCS
git
comments
despite its problems, cvs is in widespead production use, with clients on all major platforms. subversion + arch, currently only run on posix based systems (inlcuding gnus, bsds + osX). darcs looks interesting (runs on posix and win32)
look further into the possibility of integrating darcs into the OsX filesystem workflow using a combination of folder actions and or applescript/platypus/cocoadialog.. .
http://keithp.com/blog/Repository_Formats_Matter/
A nice and interesting email from 24 Okt 2007 on the PD -dev mailing list:
From: rosejn at gmail.com Subject: Re: [PD-dev] SVN? Date: 24 October 2007 2:39:19 PM To: pd-dev@iem.at There are a couple key factors in this decision that seem to be getting confused with each other. I'm new to PD, but I've been using Git for over a year now with my own repositories. In various consulting projects over the last few years I've developed with CVS, SVN, Mercurial, Darcs, Arch and Bazaar-ng. There are a couple facets to the decision. 1) Distributed Development The first, and I think most important, is the development model. Distributed development supports a different kind of collaboration, that I think is showing itself to be quite advantageous for open source software. It lets groups of people organize themselves, get a piece of code working, and then it can be shared with the world. Permissions and management of a central repository is no longer an issue. It doesn't exist. Instead Hans-Christoph, for example, would manage an official pd-extended branch that had the fully tested features, while everyone else could still share the latest and greatest stuff without having a centralized bottle-neck. 2) Personal development advantages In git you have the whole repository locally. This takes more storage space, which is incredibly cheap, but saves LOTS of time. SVN can never merge as well as GIT because you don't have past versions of the repository to merge against. Additionally, with git you get local versioning, local branching, and tons of distributed back-ups of the repository. I'm always working on the train, for example, and with Git I can maintain a much better log of my development because I commit all the time without needing internet access. Branches in git, even for huge projects, are basically free. In space and time. This changes the way you develop. You can make a branch for any little idea you want to mess with or any tricky feature you are working on. It's quite easy to then merge these branches, share them with friends, update them from other's repositories etc. This is a level of power and control that you can not get with svn. 3) User Interface Git was originally designed to function as a back-end suite that would support easy to use front-end utilities. That has changed over time, and it now includes a nice set of commands and tools that make it work just like you would expect. For sure Git doesn't have the GUI plugins like SVN or CVS and it will take some learning, but I think the benefits in the long term will far, far outweight the initial investment. There is a Tk interface, GitK, which is incredibly useful to visually look at the history of a repository, merges, branches etc. There is also git-web, which lets you view the repository online. Git also lets you publish a repository just by copying a directory to an http accessible location. This means anyone can share their ideas, features, abstractions etc, without setting up a server, getting permission for a centralized location or anything. Also relevant to the user experience is speed. Git blows away all the other systems in terms of speed. It does everything faster, and you really notice this because it changes what you do. In some of the other systems, like Bazaar-ng, committing, pulling and pushing took so long I wouldn't do it that often, but with git it's all so cheap I do it every time I get a new unit test to pass. 4) Technical Git writes the repository into a highly compressed format, and then it does not mess with the files. Append only is the standard operation, except for when you occasionally compress the whole thing, in which case it can verify sanity. It is also much cleaner to work with in terms of permissions and access, just because of the whole usage model. I've had many experiences with subversion where the repository had to be recovered because of permissions issues and/or corruption. This is very bad, especially since it is a centralized server that everyone is counting on. It takes a root user to go in and run "svnadmin recover", which in my opinion is a command that shouldn't need to exist in something as important as a source repository. Keith Packard who runs the X.org project wrote a very good post detailing his research and opinions into source control. It's a bit old now, but worth the read: http://keithp.com/blog/Repository_Formats_Matter/index.html Sorry for the long post, but this is something I have dealt with a lot recently. In my mind distributed versioning is the only proper way to run a modern open source project. PD is a great piece of software with a seemingly cool and diverse group of developers, which seems perfect for a decentralized model. Either way, I think the decision should be made first between a centralized or distributed versioning system, and then the decision can be simplified to figuring out compatibility, usability etc. It's great news that Git now runs on windows though, because I think it is far better than the others. To quote one of Hans-Christoph's recent signature lines: Mistrust authority - promote decentralization. - the hacker ethic