Understanding "Equality and Identity"?

Understanding Equality and Identity

[ SHORT VERSION]

Overview

[…]

Core Concepts

Equality (==)

Equality tests whether two objects have equivalent values. In SuperCollider, the == operator compares the contents or values of objects:

// Basic equality examples
"hello" == "hello"              // true - same content
[1, 2] == [1, 2]               // true - same elements
(a: 1, b: 2) == (a: 1, b: 2)   // true - same key/value pairs

// More complex equality
[\a, \b] == ['a', 'b']         // true - symbols compare equally
[1, [2, 3]] == [1, [2, 3]]     // true - deep equality comparison

Identity (===)

Identity checks if two variables reference exactly the same object in memory:

// Basic identity examples
a = [1, 2];
b = a;        // b references the same array as a
c = [1, 2];   // c is a new array with same content

a === b       // true - same object
a === c       // false - different objects
a == c        // true - but equal content

̶#̶#̶ ̶ ̶-̶-̶-̶M̶a̶t̶h̶e̶m̶a̶t̶i̶c̶a̶l̶ ̶F̶o̶u̶n̶d̶a̶t̶i̶o̶n̶s̶-̶-̶-̶-̶
̶
̶#̶#̶#̶ ̶C̶a̶t̶e̶g̶o̶r̶y̶ ̶T̶h̶e̶o̶r̶y̶ ̶P̶e̶r̶s̶p̶e̶c̶t̶i̶v̶e̶
In category theory, identity has a deeper meaning focused on structure preservation:

̶1̶.̶ ̶̶̶O̶b̶j̶e̶c̶t̶s̶ ̶a̶n̶d̶ ̶M̶o̶r̶p̶h̶i̶s̶m̶s̶̶̶
̶ ̶ ̶ ̶-̶ ̶O̶b̶j̶e̶c̶t̶s̶:̶ ̶A̶b̶s̶t̶r̶a̶c̶t̶ ̶e̶n̶t̶i̶t̶i̶e̶s̶ ̶(̶s̶e̶t̶s̶,̶ ̶t̶y̶p̶e̶s̶,̶ ̶s̶p̶a̶c̶e̶s̶)̶
̶ ̶ ̶ ̶-̶ ̶M̶o̶r̶p̶h̶i̶s̶m̶s̶:̶ ̶S̶t̶r̶u̶c̶t̶u̶r̶e̶-̶p̶r̶e̶s̶e̶r̶v̶i̶n̶g̶ ̶m̶a̶p̶s̶
̶ ̶ ̶ ̶-̶ ̶I̶d̶e̶n̶t̶i̶t̶y̶ ̶m̶o̶r̶p̶h̶i̶s̶m̶:̶ ̶A̶ ̶s̶p̶e̶c̶i̶a̶l̶ ̶t̶r̶a̶n̶s̶f̶o̶r̶m̶a̶t̶i̶o̶n̶ ̶t̶h̶a̶t̶ ̶p̶r̶e̶s̶e̶r̶v̶e̶s̶ ̶s̶t̶r̶u̶c̶t̶u̶r̶e̶
̶
̶2̶.̶ ̶̶̶I̶d̶e̶n̶t̶i̶t̶y̶ ̶F̶u̶n̶c̶t̶i̶o̶n̶̶̶
̶̶̶̶s̶u̶p̶e̶r̶c̶o̶l̶l̶i̶d̶e̶r̶ ̶/̶/̶ ̶M̶a̶t̶h̶e̶m̶a̶t̶i̶c̶a̶l̶ ̶i̶d̶e̶n̶t̶i̶t̶y̶ ̶f̶u̶n̶c̶t̶i̶o̶n̶ ̶f̶ ̶=̶ ̶{̶ ̶|̶x̶|̶ ̶x̶ ̶}̶;̶ ̶ ̶/̶/̶ ̶f̶(̶x̶)̶ ̶=̶ ̶x̶ ̶f̶o̶r̶ ̶a̶l̶l̶ ̶x̶ ̶ ̶/̶/̶ ̶P̶r̶o̶p̶e̶r̶t̶i̶e̶s̶ ̶o̶f̶ ̶i̶d̶e̶n̶t̶i̶t̶y̶ ̶g̶ ̶=̶ ̶{̶ ̶|̶x̶|̶ ̶x̶ ̶+̶ ̶1̶ ̶}̶;̶ ̶x̶ ̶=̶ ̶4̶2̶;̶ ̶ ̶/̶/̶ ̶L̶e̶f̶t̶ ̶a̶n̶d̶ ̶r̶i̶g̶h̶t̶ ̶i̶d̶e̶n̶t̶i̶t̶y̶ ̶f̶.̶(̶g̶.̶(̶x̶)̶)̶ ̶=̶=̶ ̶g̶.̶(̶x̶)̶;̶ ̶ ̶/̶/̶ ̶t̶r̶u̶e̶ ̶-̶ ̶l̶e̶f̶t̶ ̶i̶d̶e̶n̶t̶i̶t̶y̶ ̶g̶.̶(̶f̶.̶(̶x̶)̶)̶ ̶=̶=̶ ̶g̶.̶(̶x̶)̶;̶ ̶ ̶/̶/̶ ̶t̶r̶u̶e̶ ̶-̶ ̶r̶i̶g̶h̶t̶ ̶i̶d̶e̶n̶t̶i̶t̶y̶ ̶̶̶̶
̶

Mathematical vs Programming Identity

The distinction between mathematical and programming identity is crucial:

  1. Mathematical Identity

    • About value preservation and structure
    • Independent of storage or implementation
    • Based on abstract properties
  2. Programming Identity

    • About memory location and object references
    • Implementation-dependent
    • Practical tool for program logic

Programming Language Perspectives

Haskell’s Approach

Haskell represents a pure functional approach to identity:

-- Identity function composition
main :: IO ()
main = do
  print (id (addOne 5))      -- Output: 6
  print ((id . addOne) 5)    -- Output: 6
  print ((addOne . id) 5)    -- Output: 6

-- Memory identity using StableName
data Somebody = Somebody { identifier :: String, age :: Int }

main :: IO ()
main = do
    let entity1 = Somebody "Zephyr" 30
    let entity2 = entity1
    let entity3 = Somebody "Zephyr" 30

    sn1 <- makeStableName entity1
    sn2 <- makeStableName entity2
    sn3 <- makeStableName entity3

    print (sn1 == sn2)  -- True  - same memory
    print (sn1 == sn3)  -- False - different memory

SuperCollider’s Implementation

SuperCollider provides both concepts explicitly:

// Equality for value comparison
Set[1, 2] == Set[2, 1]    // true - same elements
Array[1, 2] == Array[1, 2] // true - same structure

// Identity for reference comparison
a = Set[1, 2];
b = a;
a === b                    // true - same object

Special Cases and Edge Cases

Singletons and Special Objects

// nil - singleton
nil === nil              // true
nil == nil               // true

// Symbols - unique instances
\hello === \hello        // true
'hello' === 'hello'      // true
Symbol("hello") === \hello  // true

// Numbers
1 == 1.0                 // true - equal value
1 === 1.0               // false - different types

Collections and Containers

Different collection types handle equality and identity differently:

// Dictionary vs IdentityDictionary
d = Dictionary.new;
d[1] = "one";
d[1.0];         // "one" - uses equality

i = IdentityDictionary.new;
i[1] = "one";
i[1.0];         // nil - uses identity

// Event behavior
e = (hello: 100);
e[\hello] == e['hello']     // true
e[Symbol("hello")] == e["hello"]  // true

Practical Applications

Pattern Matching

Identity is crucial in pattern matching:

// Pattern matching with symbols
p = Pbind(\degree, Pseq([0, 1, 2], inf));
p = Pbind('degree', Pseq([0, 1, 2], inf));  // equivalent

// Event matching
e = (type: \note, degree: 0);
e[\type] === \note     // true

Server Integration

// Server objects
s = Server.default;
t = Server.default;
s === t              // true - singleton

// Synth nodes
x = Synth(\default);
y = Synth(\default);
x === y              // false - different nodes

Best Practices

  1. When to Use Equality (==)

    • Comparing values or contents
    • Testing structural equivalence
    • Dictionary key lookup
    • Pattern matching
  2. When to Use Identity (===)

    • Ensuring exact same object
    • Performance-critical comparisons
    • IdentityDictionary keys
    • Singleton checking
  3. Performance Considerations

    • Identity comparison is faster
    • Symbol comparisons are efficient
    • Deep equality can be expensive
    • Use IdentitySet/IdentityDictionary when appropriate

Common Pitfalls

Floating Point Issues

// Floating point precision
0.1 + 0.1 + 0.1 == 0.3     // might be false!
(0.1 + 0.1 + 0.1 - 0.3).abs < 1e-10  // better approach

Collection Pitfalls

// Nested collection comparison
a = [[1], [2]];
b = [[1], [2]];
a == b           // true - deep equality
a === b          // false - different objects
a[0] === b[0]    // false - nested arrays differ

Implementation Notes

Hash Methods

Objects implement hash methods for lookups:

// Hash for equality
a = "hello";
b = "hello";
a.hash == b.hash           // true
a.identityHash == b.identityHash  // false

Conclusion

Understanding equality and identity in SuperCollider requires:

  • Recognizing the distinction between mathematical and programming concepts
  • Understanding implementation details and their implications
  • Knowing when to use each comparison type
  • Being aware of special cases and potential pitfalls
4 Likes

This is great! Just a comment that occurred to me regarding the “mathematical” concept of identity.

Firstly, I think the problems people might run into when first encountering identity in programming is more due to their not thinking of identity in formal terms at all… that is, they do not come to programming with a category theory, or even math, background but rather with a more naive, “everyday” understanding of it. I would conjecture that if somebody already knows about category theory, they won’t really struggle with identity in programming, not because the respective conceptions of id in these fields are similar, but because they are use to thinking in more abstract, formal terms and are aware that id is in effect a technical term.

So in a way this could be less theoretically specific here, even to the point of saying something along the lines “identity in programming and mathematics is a technical term, and as such more clearly defined than the notion of “sameness” in everyday language, …”, and then put the category theory terminology in a footnote, and go on with the id function example.

(EDIT: cutting unnecessary thread-derailing bits)

I think the discussion re the id function is good (just the placement of the comment // properties of identity may be revisited, as the g function isn’t an identity, nor is x.)

1 Like

The idea was to have a supplement, another help document with proper definitions.

Do you see that as confusion or material for those wanting to understand the fundamentals?

Because I see a lot of confusion among all users, one usually picks a definition from one language or another, which has no real fundamentals. That’s why mathematics is the only reference.

We can discuss how each programming language design uses those ideas from there. Memory location will force those concepts to change a lot, for example.

It does not say g is an identity function, it means this:

identity function properties:

* Left identity: f(g(x)) = g(x)
* Right identity: g(f(x)) = g(x)

That’s where those concepts were developed:

Category Theory / Type Theory 101:

I am aware, but it may be confusing still. I imagine something like this could help:

// Properties of identity
g = { |x| x + 1 }; // given some arbitrary function g 
x = 42; // and some arbitrary x...

// Left and right identity is defined as follows:
f.(g.(x)) == g.(x);  // true - left identity
g.(f.(x)) == g.(x);  // true - right identity

Also, counterexamples help, but obviously they result in more text…

1 Like

It would also be worth mentioning that hash comparison doesn’t ensure either equality or identity, because of hash collisions.

Two unequal entities may have the same hash, simply because there are more than 2^32 possible entities (hence some of them must share a 32-bit int hash).

If hashes are unequal, then the entities cannot be equal. But if the hashes match, the entities may yet be different.

hjh

1 Like

Can we put it like this?

  • If two objects have different hashes, they are different
  • If two objects have the same hash, they might be identical (but aren’t necessary!)
  • Identity hashes are always unique for different object instances
  • Regular hashes are content-based and can have collisions

Pratical implication:

Hashing is very adequate for:

  • Dictionary/lookup table
  • Quick equality comparisons
  • Caching systems
  • Data integrity checks

Along with your explanation on why collisions can happen.

Can’t watch that video right now, and also I’m not sure if I understand your reply correctly here (what does “That” and “those concepts” refer to?). I assume you mean “Set theory was where those [category theory] concepts were developed.”, correct me if I’m wrong.

Can’t argue with that, but the notion of identity is a bit older than category theory and modern set theory. I’m not saying that it’s “wrong” to include category theory here, but more that in the context of an sc documentation, what matters is the contrast/distinction from understandings of identity that aren’t those operative in sc; and I’m not sure if this needs to be that specific. And this really only concerns the bit about objects and morphisms! For instance, the rest of the draft never reuses the term “morphism”, so that’s a good indication that it isn’t very helpful here.

On the other hand, I’d be very much in favor of including footnotes with links to encyclopedia entries, e.g. Identity (Stanford Encyclopedia of Philosophy) or Category Theory (Stanford Encyclopedia of Philosophy).
It’s just that these are long-standing philosophical issues that we can’t possibly adress usefully within the docs, so we should focus on the more general aspect, which in my opinion is just (for newcomers to programming and/or math) that formal notions of identity aren’t necessarily coextensive with everyday notions of identity.

1 Like

Set Theory was a serious problem on its basis. In computer science, I think we find profound research borrowing ideas from category theory (and I agree that it is overkill). But Set theory is not relevant, I think.

1 Like

(EDIT: cutting unecessary thread-derailing bits)

1 Like

1901 (way before the 1960s, as you mentioned).

1 Like

(EDIT: cutting unecessary thread-derailing bits)

1 Like

Well, we can enter a confusing territory here. Let’s go slow.

Today, Sets kinds of are category theory. And it happens in different ways.

First, small categories are actually “sets.”

Another example is just defining sets using category theory.

http://www.tac.mta.ca/tac/reprints/articles/11/tr11.pdf

So, yes, it can be used in many simplified contexts. But there is no benefit here.


I do not understand why you are discussing this.

Primarily because it’s interesting (as you too seem to think) but also because I have the reflex to correct people if they seem to imply that I am misinformed. Let’s leave it there.

1 Like

Thank you for this!
I would close my PR if you open a PR with it.
However, I have a question: Why is it necessary to mention the Haskell? To understand the Haskell part, the reader should already know the corresponding syntax and semantics in Haskell…

1 Like

It is not necessary. However, it is a good example of a programming language that uses another concept of identity and allows the simulation of how SC works (memory location).

It can be improved to make it more clear. It’s just a draft.

It could be an excellent example for those who are already familiar with Haskell. However, without an explanation of why the Haskell example is presented at this point, it might feel somewhat out of context. Additionally, newcomers might perceive that understanding Haskell is a prerequisite for using SuperCollider. Thus, it might be better to omit the Haskell example in this context.

Nevertheless, I would like to retain this example if it is included outside the SC documentation, or if the section title is “More to Read for Interest: Implementation Comparison of Pure Functional Approach and Pure Object-Oriented Approach” within a note:: or cf:: (compare) context. Unfortunately, sclang does not have cf:: tag.

2 Likes

You are right.

But I think the idea was to have a separate reference with some intermediary and more complete information.

The Haskell code can be removed or not; in any case, I agree there must be a better text there.

1 Like

My suggestion would be to move it to the end and clearly mark it as an addendum, so that it’s clear that it isn’t part of the main line of the argument. When it’s in the middle, it seems like “must-know” but I agree with prko that for most users, it won’t be.

SC documentation could use more side topics like this (while being mindful of the maintenance cost, which should be minimal in this case)! Just as long as they’re clearly marked.

hjh

3 Likes