Virtual Surreality

It's too real to be true

Object/Relational Mappings (ORM’s) are in the wrong place in the architecture.

An application should have minimal impedance mismatch with the persistence of its own data. External or ancillary systems should bear the cost of mapping between paradigms. If you are writing an application with the concepts modelled in an object-oriented fashion, then the state of those objects should be stored in an object-oriented fashion. Mapping from the object-oriented structure to a relational structure should be done where and when it’s needed. If you want to access application data in a relational way for reports, do the mapping for the report.

There has been a low uptake of OODBMS’s. Why is outside the scope of this post. However, there are other, older (albeit feeling newer) data storage techniques available that more readily persist an object graph than a relational database. A flurry of open-source implementations have come along. You may have heard of MongoDB, neo4j, Redis, Hadoop, CouchDB, Memcached, and Raven DB (a native .NET document database). These are all under active development, and almost all have been proven on many large commercial projects (Raven was being developed by Oren while he was at speakerconf 2010!).

For an introductory look at evaluating the landscape, check out Sarah Mei’s RailsConf presentation slides.

One of the projects I’ve worked on at Hashrocket involved a large MongoDB database. The reasons for the choice for MongoDB were clear:

  • the way the users handled most of the data more naturally fitted an embedded document model;
  • the scale and speed required were far greater than relational databases could achieve; and
  • the need for ACID transactions was almost nil

Additionally, the data was mostly read, rather than written (although MongoDB writes blazingly fast too). I’m hoping to address the equivocations in those reasons in a subsequent post.

MongoDB uses a binary form of JSON. As JSON is a serialization format for object state, it’s a good candidate for persisting an object graph. JavaScript, remember, uses composition rather than inheritance, so there is also a tendency to “embed” document instances within other document instances. So, for MongoDB, most of the object graph is stored a tree structure. Cyclic graphs and associated objects (those that aren’t aggregated with composition) are handled with references to identifiers.

As a consequence of using Rails and MongoDB, the team looked to various ways of manipulating the JSON structures from Ruby: directly through the driver, MongoMapper, MongoDoc, and Mongoid. The rationales behind these libraries and their different and interesting approaches are the subject of additional blogs (hopefully jointly with Nunemaker, Hill, and Jordan respectively). The needs and makeup of the project we were on fell into the lap of mongoid to solve.

DBAs want to interact directly with the database. Because of this “new-fangled” approach to data storage that has been around since before SQL, some are unwilling or unable to learn the query language to interact directly with the database. So, to make their world a less scary place and let them retire without the stress of intellectual endeavour, I built a simple, declarative language where the mapping from the tree structure of MongoDB to the tuple space in an RDBMS can be scripted.

Durran Jordan (author of Mongoid) and I paired on “squealer” (and on emptying bottles) in our evenings while he was stationed in the Chicago office of Hashrocket. The name came from “SQL” (which some of us still pronounce “squeal” rather than “sequel”) and the facetious notion that it turned the data into a pig.

We decided to focus on mySQL as the target and MongoDB as the source as these are the technologies we were using. Upcoming targets will include PostgreSQL, SQL Server, Oracle. Upcoming sources will include Redis and CouchDB.

UPDATE: squealer 2.2 now supports PostgreSQL!

Matt Yoho provided multiple insights into implementing the Ruby DSL, and Bernerd Schaefer helped me reduce the DSL syntax and provide sensible defaults (as well as the righteous progress bar). Bernerd then struck on the idea of reflecting on the Mongoid declarations in the Rails domain model classes to generate an initial squealer script and SQL DDL to build the target SQL database. He then set about writing it overnight and we tweaked it over lunch. We called this tool “skewer”, because that’s what you do to something to make it squeal ;-) . It’s part of the squealer RubyGem.

Next time: how to use skewer and squealer.

After 4.5 years at ThoughtWorks, I’ve put down the purple elephant and donned my spacesuit to join Hashrocket and open the Chicago office.

Hashrocket is the world’s best Ruby on Rails shop — a world-class team of spectacular developers, lead by my former and now current colleague Obie Fernandez.

Our Chicago office has been set up to help draw in more talent to meet our growing client needs. Some devs would prefer to live in a big city and Chicago has a strong Ruby community.

If you’re a Rubyist in Chicago, you can look forward to Hashrocket soon hosting some of the user groups and other events. We will be open at 661 W Lake St on Feb 1, 2010.

Announcing PreparedEval, prepared statements for generating JavaScript (and the like) from Java. It prepares Java strings for JavaScript evaluation, similar to SQL prepared statements. Saves you from quote escape nightmares! Great for Selenium tests and Rhino JavaScript.

As a follow-up to my post on escaped, quoted strings, Taking the idea of using a different encoding scheme further (and not being super happy about the need to pump the string through unescape()) I thought about interpolation and wondered about prepared statements in SQL. Some quick research turned nothing up, but while chatting with Ola Bini he also mentioned prepared statements, so this convinced me to knock something together that might be useful.

More information is at http://wiki.github.com/delitescere/PreparedEval.

There’s no runtime dependencies so it’s nice and tiny if you just want the JAR.

Let me know if you use if to help with CSS / XPath selectors for Selenium or if you have any suggestions on improving the API.

When you have more backslashes than words in the strings you are constructing, perhaps it’s time to look at another way of handling special characters like quotes.

The problem

You’re generating JavaScript code from another language with C-like quoting and escaping syntax (e.g. Java).

As part of that code generation, you’re substituting strings that have both double quotes (“) and single quotes (‘). My team’s actual problem stems from using XPath locators which include quoted text and sending that to Selenium so we can obtain the particular element that contains the text*.

Let’s say we want to pop up an alert within which are some double quotes:

String original = "Hello \"stranger\", how are you?";
String quoted = original.replaceAll("\"", "\\\"");
String script = "alert(\"" + quoted + "\")";
eval(script);

We then expect the browser to have this JavaScript code to process:

alert("Hello \"stranger\", how are you?")

and when then expect the alert to display the string:

Hello “stranger”, how are you?

But why doesn’t it work?
continue reading…

I’m visiting our India operation for a few months, mostly at our Pune office. I’m trying my hand at various coaching activities on a number of fronts, including working with some of our technical leads, aspiring architects, business development team, and recruitment.

I’ve always been a fan of our teams in India and China. In the early days of the ThoughtWorks Sydney office, to help fill the rapidly growing demand pipeline, we were able to bring people in from TW offices around the world, including some from India (Nitesh rocks!). We also had to distribute some delivery work to India (and later, China too) on some large projects. This was an eye-opening time for us in learning how to make Agile software development work across teams in different places and timezones. (People like Ben Hogan, Naresh Jain, Shyam Mohan, and Jane Robarts have been able to share some of that experience at conferences.)

It wasn’t the reduced rates that these projects were able to offer to clients that appealed to them – it was the large capacity of ThoughtWorks talent available that we simply couldn’t fill quickly enough locally. ThoughtWorks and clients alike insisted on top grade delivery team members, no matter where they happened to be located during any given iteration (we rotated people through the locations quite a bit to engender better collaboration, shared understanding, and teamwork).

Being a senior member of the company for four years, I had always felt a little incomplete in not having visited any offshore teams – based on the stories from others who had visited, I was missing out. I’m pleased now I’m here and I can experience for myself the atmosphere, attitude and aptitude of the Pune office and I’ll get to meet everyone from Chennai and Bangalore at the ThoughtWorks India Away Day in a few weeks.

It’s also pleasing that we have evolved the offshore capability to the point it is a compelling set of offerings that really appeal to companies looking for top quality, technical innovation, agile software delivery, and commitment to results within leaner budgets. http://offshore.thoughtworks.com/

I’m very excited to be here right now. Amanda is knocking out her “F# In Action” book from Manning as fast as an autorickshaw darts between pedestrians, I’m working with some awesome new and old faces, and the dal makhani is delicious!

However, I’m not having much luck getting people to call me “RoganJoshG”… ;-)

A recent public panel discussion was held in the ThoughtWorks Chicago office in which Martin Fowler, Rebecca Parsons, Neal Ford, Ian Cartwright, Erik Doernenburg, Ola Bini, and myself discussed the four topics Cloud, Language Workbenches, Developer Certification, and Polyglot Programming.

The recordings have been released at http://securerespond.com/thoughtworks/tab/

There’s also links to a couple of podcasts recorded the next day by Neal Ford (one of which is also with me).

Should it be assertEquals(expected, actual) or assertEquals(actual, expected)?

This discussion came about because at the Chicago Code Camp, Jim Suchy from 8th light was demonstrating TDD in JavaScript by building a simple unit testing library. One of the methods he created was the typical assertEquals. In the code on github he had a signature of assertEquals(expected, actual) while during the demo he swapped the order, and I pointed out that the original was what I would have anticipated. He commented that there was an interesting discussion about it at his office a few days before. I’m interested to hear more about it, and I tweeted that I was inclined to get a debate going about it just to see what the different rationales were. This post is a collation of thoughts and opinions to date.
continue reading…

Today’s blog is sponsored by the word “awesome”.

I’ve just come back from a trip to the awesome QCon London 2009 and the (thankfully) awesome speakerconf 09 (more of both in blogs soon, promise!)

Coming at the beginning of May, though, is the most awesome JAOO Australia 2009 in both Sydney and Brisbane. Last year’s inaugural event was awesome and this year’s looks to be its match.

The awesome Dave Thomas asked me to speak there on “something Enterprisey”, so I’m putting together notes and experiences on applying Agile practices to enterprise architectures. It’s easy to see that current economic drivers make it imperative for a firm’s technology leaders to do more to embrace rapid business change within IT. The talk will draw on ThoughtWorks projects across the globe, including those I participated in both in the US and Australia. It’s Neo Architectural. It’s for Modern Architects.

Also, my very talented and awesome wife, Amanda Laucher, will be presenting on DSLs and language-oriented programming, providing some insight on when to make one and some samples in C#, F# and MGrammar. Don’t worry if you only know Java or Ruby, you’ll get plenty out of the talk if you want to learn more about DSLs.

Amanda will also be teaming up with Joel Pobar to provide an Introduction to Functional Programming tutorial, with a focus on the awesome F# language. If you want to know about partial applications, dynamic pattern matching, idiomatic units of measure, and asynchronous workflows on top of the usual functional programming concepts, then this talk is for you. If someone at work is saying “we need to exploit concurrency” then this talk is for you. Again, if you’re coming from Java land that’s fine, there’s enough to get you looking at Scala or the awesome Clojure languages too.

CodeMash

2 comments

I’ve been given the privilege of speaking at CodeMash 2009 in Sandusky, Ohio this coming January.

The presentation and discussion will cover the learning from implementations of Guerilla SOA and will include code and demonstration based on the WCF. It may change some minds about the default approach to the WCF programming model and open some eyes on how to build a truly message-oriented service oriented ecosystem.

Looking forward to a conference that comes highly recommended.

The submitted abstract

Title: Guerilla SOA for WCF
Track: Arch & Design
Level: Intermediate

Description:
This talk goes over the fundamentals of Guerilla SOA (a concept conceived in ThoughtWorks and championed by Dr Jim Webber) and how they are applied to WCF in both a SOAP and REST context.

Using principles from agile software development into an emergent architecture, particularly for services, the talk looks at an alternative approach to the usual usage of the WCF programming model which provides a true message-oriented approach (rather than the RPC hole almost every WS-* stack drives us into).

Drawing on real large-scale projects, we’ll touch on consumer-driven contracts, sensible XSD, alternate validation techniques, LINQ to XML, XPath, and a convention-based and MVC approach to web service implementation.

If you want to develop Ruby applications that use the services in the Microsoft® .NET Services cloud, then this SDK is for you:

http://www.dotnetservicesruby.com/