This blog post explores references, a mechanism used by the ECMAScript language specification to explain the difference between the following two expressions:
obj.prop() (0, obj.prop)()
Method calls versus function calls
Consider the following object:
var obj = { getThis: function () { "use strict"; return this; }, };
If you call obj.getThis
, you have a method call (this
points to the object in which the method is stored):
> obj.getThis() === obj true
If you store obj.getThis
in a variable and then call it, you are making a function call:
> var func = obj.getThis; > func() undefined
The effect is the same if you use the comma operator. Quick recap – the comma operator works like this:
(expr1, expr2) === expr2
That is, both expressions are evaluated, the result of the whole expression is expr2
.
If you apply the comma operator to obj.getThis
before calling it, you are also making a function call:
> (0,obj.getThis)() undefined
What the first operand is doesn’t matter at all, here, I use 0
, because its short. I’d expect many JavaScript engines to optimize and eliminate the evaluation of the first operand.
However, only using parentheses does not change anything:
> (obj.getThis)() === obj true
So what is going on? The answer has to do with references.
References, a data structure of the ECMAScript spec
References are a data structure that is used internally by the ECMAScript language specification. A reference has three components:
- Base value: is either
undefined
, a primitive value, an object or an environment record.undefined
means that a variable name could not be resolved. Accessed viaGetBase(V)
(given a referenceV
). - Referenced name: is a string or a symbol. Accessed via
GetReferencedName(V)
(given a referenceV
). - Strict reference: flag indicating whether or not the reference was created in strict mode. Accessed via
IsStrictReference(V)
(given a referenceV
).
Examples of JavaScript expressions that produce references:
- Property reference: Evaluating
obj.prop
in strict mode produces the reference(obj, 'prop', true)
. - Identifier reference: Evaluating
foo
in strict mode produces the reference(env, 'foo', true)
.env
is the environment record where the variablefoo
is stored.
The flag for strict mode is necessary, because some operations cause an exception in strict mode, but fail silently in sloppy mode. For example: setting an unresolved variable in sloppy mode creates a global variable, setting it in strict mode throws a ReferenceError
.
These are two operations (of several) for references:
GetValue(V)
ifV
is a value, the result isV
. IfV
is a reference, the result is the value pointed to by the reference. This conversion from reference to referenced value is called dereferencing.PutValue (V, W)
writes the valueW
to the referenceV
.GetThisValue(V)
is only called ifV
is a property reference. For normal references, it returns the base value. For references created viasuper
, it returns the additional component thisValue that they have ((which is needed for super property references)[http://exploringjs.com/es6/ch_classes.html#super-properties]).
References in the example
We are now ready to understand the examples we looked at earlier.
The following expression produces a reference:
obj.getThis
If you function-call this reference ref
then this
is set to GetThisValue(ref)
.
If you wrap obj.getThis
in parentheses, nothing changes, parentheses only syntactically group things, but the don’t influence how something is evaluated. That is, the result of the following expression is still a reference:
(obj.getThis)
If, however, you assign the reference returned by obj.getThis
to a variable, the reference is dereferenced:
var func = obj.getThis;
In other words: what is stored in func
is a function, not a reference. In the language spec, assignment operators use GetValue()
to turn references into values.
The comma operator also dereferences its operands. Consider this expression:
(0, obj.getThis)
The comma operator`) to ensure that the result of each operand is dereferenced if it is a reference.
References and bind()
References only being temporary is also the reason why you need to use bind
if you want to turn a method into a callback (first line):
var log = console.log.bind(console); log('hello');
If you simply did:
var log = console.log;
Then the receiver (this
) would get lost, because console.log
is dereferenced before it is stored in log
.
References in real-world code
Babel uses the comma operator to avoid function calls being transpiled to method calls.
Consider, for example, this ES6 module:
import assert from 'assert'; assert(true);
Babel compiles it to this ES5 code:
'use strict'; function _interopRequireDefault(obj) { return obj && obj.__esModule ? obj : { 'default': obj }; } var _assert = require('assert'); var _assert2 = _interopRequireDefault(_assert); (0, _assert2['default'])(true); // (A)
Why does the ECMAScript language specification use references?
JavaScript engines, which are implementations of the ECMAScript language specification, don’t actually use references. That means that they are a device that helps with writing the spec. To see why, consider that they represent storage locations. Then consider that all of the following operations work with storage locations:
Reading a value:
x obj.prop super.prop obj["prop"]
Calling a function or a method:
x() obj.prop() super.prop() obj["prop"]()
Assigning a value:
x = 123 obj.prop = 123 super.prop = 123 obj["prop"] = 123
Compound assignment operators. For example, the addition assignment operator (
+=
):x += 5 obj.prop += 5 super.prop += 5 obj["prop"] += 5
typeof
:typeof x typeof obj.prop typeof super.prop typeof obj["prop"]
delete
:delete x delete obj.prop delete super.prop delete obj["prop"]
Because each storage location is represented by the same construct, a reference, the specification only needs to describe a single version of each operation instead of several versions (e.g.: delete
for a variable, delete
for a property with a fixed key, delete
for a property with a dynamically computed key, etc.).
Conclusion
You don’t actually see references when you use JavaScript. But there are languages (e.g. Common Lisp) where references are first class values. That enables intriguing applications. You can, for example, implement functions that perform an assignment for you.
No comments:
Post a Comment