DataMatrix 0.8 is finally out

At last, after months of inactivity, I pushed out a new release of DataMatrix. Although the version bump is small (0.8) there are a lot of changes since last releases. The most notable include:

  • Ability to apply functions to elements of the matrix
  • Ability to filter rows by column contents
  • Ability to transpose rows with columns
  • An option to load text files produced by R (which are, by design, broken)
  • Removed the getter for columns, using dictionary-like syntax directly
  • A lot of bug fixes

The download links on the project page have been updated, along with the documentation. Also, there is another change, because from now on the official Git repository is hosted on gitorious.org, and no longer on github, because gitorious (the software) is also free, while github.com’s is not. It’s mainly a philosophical issue (the same that prompted me to switch from twitter to identi.ca).

Also, from today DataMatrix is also officially hosted on the Python Package Index (with the name “datamatrix”), meaning that you can use easy_install to quickly install it.

If you use this module, let me know what you think (including bugs, if you find them).

DataMatrix 0.7 has been released

Finally a new entry! I’ve been extremely busy with other things, that is why I did not have time to write more. One of the main reason is related to an important landmark in my professional career, but I’ll write more about it after January 1st (hint: those who follow my Twitter updates may have already understood).

As a nice way to break the hiatus, I’m releasing a new version of DataMatrix, my implementation of R’s data.frame in Python. Although the version bump is small, there are loads of improvements. First of all, there is proper support for file-like objects, as well as support for appending and inserting both rows and columns. writeMatrix has been substantially improved and now writes files correctly, and I have added (experimental) support for a DataMatrix object that does not require files – EmptyMatrix. Also, there is now proper documentation. Last but not least, unit tests have been added, a good way to watch out for regressions in the code.

Finally, this version marks the entrance of dalloliogm as contributor to the code. He gave quite a number of helpful hints, especially with regards to unit tests.

I’m quite satisfied on how DataMatrix behaves – as a matter of fact I use it extensively on a number of internal projects.

You can grab DataMatrix 0.7 as a source package or as a Windows installer. Comments are welcome.

DataMatrix page up

Ok, ok… my definition of “tomorrow” is not like what most people use, apparently. Although I took quite a while, now there is a static page on DataMatrix. There you will find a summary of wht I wrote in my other blog posts regarding this module. Of course, it will be kept up-to-date should I release a new version.

Aside that, I put a contact form on this blog. It may be useful for reports that aren’t directly related to the posts.

DataMatrix 0.5

At last, since it’s been like ages, I decided to put out a new version of DataMatrix. For those who haven’t seen my previous post, DataMatrix is a Pythonic implementation of R’s data.frame. It enables you to manipulate a text file by columns or rows, to your liking, using a dictionary-like syntax.

In this new version there have been a few improvements and correction to a couple bugs (for example saveMatrix did not really save) and the start (only a stub at the moment) of an append function to add more columns (I’ll also think about a function to add rows).

DataMatrix is licensed under the GNU GPL, version 2 only. You can download the installer (Windows) or the source distribution (Linux and other *nixes). The only requirement is Python 2.5 or later installed on your system.

The README currently is a stub, but you can browse the pydoc generated documentation, which details how to instantiate and use DataMatrix objects (or you can turn to my older post).

Also, since git is the new “cool feature of the day”, DataMatrix is is hosted on github’s repository, and you can grab the source with

git clone git://github.com/cswegger/datamatrix.git

Comments and suggestions are welcome. I’ll be putting a static page on DataMatrix tomorrow, if time permits.