It’s been about a year since I started using Go. I’ve written a lot of code with it, including an entire micro-service and a contribution to glide. I want to emphasize that, for the most part, it’s been pretty good. But about once a month, my eye starts twitching and I get the urge to start making a head-to-desk motion. The reasons for this can be pretty simply demonstrated. Since I seem to have this conversation on a regular basis, I thought I’d record my thoughts so I can just simply reference this the next time it comes up.
A Simple Data Structure
I need a data structure. It has the following requirements:
- It must store the name of an animal and an associated struct containing some information regarding the animal.
- It must have O(1) lookup time.
- I must be able to iterate over it’s elements in some predefined order.
- It must be immutable¹.
At first glance, we might be tempted to simply use a map like so:
type AnimalMap map[string]Animal
This is kinda nice. In fact, it satisfies the first two requirements of our data structure. But since the very early days of Go, the order objects are stored in a map is random (for good reason). So I can’t do something simple like insert all the elements into the map in the same order over which I’d like them to be iterated.
Alright, no big deal. We can get around this by just writing a little bit more code, using an array to keep track of order.
type AnimalMap struct {
Mapping map[string]Animal
Order []string
}
Now whenever we want to iterate over the map, we need to iterate over the Order
slice instead, using the strings we pull off as keys into the map. Not the greatest, but it works.
What About Immutability?
This is where things get really hairy. Go doesn’t have the concept of runtime constants. If we want things to be immutable after we create them, we’re going to have to completely, and I mean completely, encapsulate both the slice and the map in our struct. This is because maps and slices have pass-by-reference semantics. In other words, if at any point I give you a reference to them, you’ll be able to modify them. The best I can come up with is now this explosion of code:
type Animals interface {
Len() int
At(i int) string
}
type animals []string
func (a animals) Len() int {return len(a)}
func (a animals) At(i int) string {return a[i]}
type AnimalMap struct {
mapping map[string]Animal
order animals
}
func NewMap(order []string, mapping map[string]Animal) AnimalMap {
newMap := AnimalMap{
mapping: make(map[string]Animal, len(mapping)),
order: make(animals, len(order)),
}
for i, name := range order {
newMap.mapping[name] = mapping[name]
newMap.order[i] = name
}
return newMap
}
func (am AnimalMap) Order() Animals {return am.order}
func (am AnimalMap) Animal(name string) Animal {return am.mapping[name]}
What just happened? That’s an infuriating amount of code to write. You know how much code I’d have to write if this were C++, C#, or Java? None. They all have reusable notions of an immutable, ordered map. Yet in Go, every time I need one I have to write 30 lines of code. I feel like I had to fight to Go every step of the way here. I feel sad.
Conclusion
Go has some pretty neat concepts. I love that it enforces composition over inheritance. Goroutines are pretty nifty. I’ve been pretty satisfied with it overall. But time and again, I find myself solving the same problems over and over when using Go. I find myself completely blocked, with no way to write DRY code².
I’d love to see the Go community address issues like this. IMHO generics would solve all of the issues above, but I understand that’s somewhat of a sore topic for the Go community. Perhaps something like a simple form of codegen for common data structures and algorithms?
Kurtis Nusbaum is a Mobile and Backend Developer at Uber. He’s an avid long distance runner and diversity advocate. Here's his mailing list. Here's his Twitter, and LinkedIn. Here’s his github. If Kurtis seems like the kind of guy with whom you’d like to work, shoot him an e-mail at kcommiter@gmail.com.
¹Here's why you might want immutability
²For instance, try writing a reusable exponential backoff algorithm or a reusable binary tree. Oh, and you can’t use interface{}. We’re programming in a statically typed language for a reason.
To solve the iterate-in-some-order-but-maintain-immutability problem, how about something like this:
...and then just:
I hadn't thought about this. This is interesting.
That said, I think the boiler plate of always having to wrap my code in a closure is a little frustrating. The example provided here is not to bad, but I think I could envision code that's more complex and this being a little more frustrating to do. Not only that, but I have to wrap each new piece of iteration code with a closure.
What do you think?
I thought about this a little more and I really like your solution. While it adds closure boiler plate at the call site, it also removes the need for the caller to write their own loop. So it's actually a wash in that sense. But it removes all the stuff I had to add for immutability. I actually really like this. My hats off to you.
That said, it's still 10 lines of code I have to write every time I want a data structure like this. Which is an order of magnitude larger than other languages.
I wish I understood why immutable structures are so important, unless you are wanting a functional programming paradigm, which leaves only Haskell or Scala. Write me something in another language that I can't mutate.
IMHO if you want something immutable, don't mutate it.
"... don't mutate it": But who verifies you didn't? Your tooling should. Think about tests, formatting and linting. People could just write perfect code and not use these tools either ;)
Immutability leads to better structured code if people think about which values actually change, which not and how they depend on each other.
Russ Cox has also outline why he thinks immutability is a good idea in his 2017 resolutions: research.swtch.com/go2017
(he also says he wants to a get a better understanding of generics :)
From the post: softwareengineering.stackexchange....
I think even these discussions miss a key point. I can't tell you how many times I've had, in Java, someone just throw "final" on their class because they thought nobody should extend it. I've had to copy entire classes because I couldn't use something from a library more times than I can count just for this reason.
Even "String" in java is final and immutable. I want mutable strings, for all the reasons mentioned in many of these posts -- much faster, easier to deal with, etc. Because of someone's idea that all strings should be immutable, I end out constructing tons of objects that I don't need, and compiler writers spend thousands of hours improving the optimizer to avoid these allocations, and programmers resorting to libraries with buffers and such in order to add simple strings together, all because someone thought string should be immutable.
"For instance, try writing a matrix/matrix multiply. Oh, and you can't use numerical types. We're writing in a language with string support for a reason."
It's not an exact analogy, but it seems disingenuous to include an arbitrary handicap and then complain that something is difficult.
I would argue that the handicap is not arbitrary. As I explained in the sentence after, using interface{} would cause the loss of static type checking which is incredibly valuable.
While not perfect, I talk about using code generation for problems like this here: dev.to/joncalhoun/using-code-gener...
And then there are a few pretty solid generic generation libs out there if you find them more useful. github.com/cheekybits/genny being one of them.
Not as nice as generics but they help.