What's a Type?
Mark June 09, 2023 [Types] #IntroThis is the first in a series of loosely related posts exploring Types, Type Theory, Type Systems, and some related content from the point of view of a computer scientist. The general idea is to provide a 10000 foot view of what I've been learning on the topic that may inspire me (or others) to remember and learn more.
Preface
Soooooooo... what is a type?
The question this blog post is trying to answer can feel rather nebulous. The problem is that in programming languages, types wear many hats. Function overloading lets you create control flow based on types, deriving mechanisms let you generate code based on types, IDEs provide some of their tooling based on type definitions, and so forth.
My plan here is to dance around the topic a bit. I want to talk about the basic types most programmers are familiar with — for example, integers. I also want to motivate future discussions in other posts about types that are a bit more abstract. For example, types that construct other types, or types that form a mathematical structure called a semiring — which among other things let you use algebra to get nice things for free.
One of the things I've hoped to avoid doing here is to mystify types (I'm not sure whether I've succeeded). While types seem to get at the fundamentals of what abstraction is about, they're also very natural. Assembly programmers have been using types since before we developed syntax for them or automated ways to enforce/check them. What I'm trying to dance around here really shouldn't invalidate anything about the types you are already familiar with. Nobody has lied to you about Booleans or Strings, they continue to be what you've always known.
We're going to start with what I undertand the general pragmatic view of types to be, then I'm going to detour us into some super basic Propositional Logic and hopefully come full circle into a very vague understanding of types and how we might change our view on what it means to annotate a program with them.
The Pragmatic View of Types in Computer Science
[A] computer is unable to discriminate between for example a memory address and an instruction code, or between a character, an integer, or a floating-point number, because it makes no intrinsic distinction between any of the possible values that a sequence of bits might mean
[...]
Assigning a data type, termed typing, gives meaning to a sequence of bits such as a value in memory
Consider the following sequence of bits:
10000001
What could this sequence of bits represent as a value?
- If that's an unsigned 8bit integer, then it could represent a
129
. - It also could represent any constant transformation of an 8bit integer:
- Absolute negative value:
-129
- Thousands:
129,000
- Even numbers:
258
- Absolute negative value:
- If that's a signed 8bit integer:
- It could represent a
-1
- If it is in 2s complement then
-127
- It could represent a
- If the bit that carries the sign isn't the standard one, there's a whole other set of numbers this could represent.
- It could be an array of two unsigned 4bit integers:
[8,1]
- The value may not be an integer at all. Getting more outlandish, it might be a bit-map for items in an adventurer's backpack. In which case this might encode two specific key items (item-id
0
and7
) out of eight key items an adventurer will need to find during their quest! - This value could be a memory address on a very small embedded system, or perhaps the system pads this with zeros to interpret it as a memory address.
- This could be a machine instruction code of some sort.
- This could be a path through a binary tree.
- The above considerations also depend on the "endianness" of the memory layout as well.
You can see the problem. You cannot tell the type of this value by inspecting its bits. Giving those bits a type is the process by which we imbue them with meaning. It doesn't matter that assembly has no means to annotate types and no automated way to check them, any programmer using assembly must do that work manually — they can not reasonably choose to avoid it. For example: it's never desirable to multiply two memory addresses together, so an assembly programmer does their best to avoid doing so. Failing in this task isn't a type error within the language, but it is still a type error conceptually and will result in an incorrect program.
Programmers need to reason about the types of the data they are working with in order to ensure that their programs are correct.
A more abstract view of types
The pragmatic view of types really bundles up the idea of capturing how data is represented. This is not typically how types act while programming. Consider how often a developer even sees the sequence of bits that represent the values they use. Especially in the basic nominal cases, developers leave the bit-fiddling to low-level operations. It really doeasn't matter if a -5
looks like 1000 0101
or 1111 1011
, so long as the operations (like addition and so on) work as expected.
In general, programmers consider even the most fundamental types to have an existential representation. A well behaved representation exists (because it must), but any other well behaved representation would do just as well. The type system forms an abstraction boundary so that so long as the API of an int doesn't change, the representation is largely irrelevant [1].
Instead of "giving meaning" to a sequence of bits, types can be seen in some ways as the opposite. They describe a program that's provably generic over the representation, which means a Java Class doesn't describe a program, but instead a whole set of programs (at least one program for each possible representation of int). It just so happens we pick binary representations that work well with our hardware.
In some languages there are pragmatic reasons (say performance/memory) to care about the representation or some part of the representation. Because of its pedigree as a systems language, Rust has 12 different integer types, though they differ on whether they are signed (support negative integers) and how much memory they require, they don't differ on any other representational decisions such as which bit is the most significant, which bit is signed, or whether signed numbers are in two's complement. Even here, the types act to abstract exactly those choices developers want to stop caring about.
Type Systems share some Similarities with Proof Systems
So if it's not really about giving sequences of bits meaning, is there a different way to look at types?
While tackling this topic, I'm hoping that you have some (perhaps partially forgotten) introductory-level knowledge of formal logic. If you’re like me, you may have encountered some propositional logic in your Grade 11 Discrete Mathematics course, or perhaps a few years later in undergraduate philosophy. I'm not going to lean on it in any in-depth way in this blog, but its a nice building block for understanding type systems and in turn, types.
While I don't want to turn this is an introduction to propositional logic, lets take a quick moment to refresh our memories on the two ways you can show that A ∧ B
entails B ∧ A
in propositional logic.
Semantic Entailment
The first way you can show that A ∧ B
entails B ∧ A
in propositional logic is via a semantic proof. You've probably seen these done via tables (often called "truth tables") like so:
A | B | A ∧ B | B ∧ A |
---|---|---|---|
T | T | T | T |
T | F | F | F |
F | T | F | F |
F | F | F | F |
Implicit in this table is the meaning of A
and the meaning of B
. To make this explicit, they denote something called a proposition. Semantically, propositions are these objects that can be either T
or F
(Sometimes read as "true" and "false" respectively). T
and F
don't appear anywhere when we discuss the syntax of propositional logic, they only appear when we ask something like "what does A
mean?"
We say that A ∧ B ⊨ B ∧ A
because when we ask what A ∧ B
, and B ∧ A
means, in every instance where A ∧ B
is T
, B ∧ A
is also T
(More generally, we can understand ⊨
here as a relationship between specific rows and columns on a truth table). This is pretty easy in this case because we can exhaustively look at every possibility: All four of them!
Maybe you think the answer to a question like "what does A
mean?" should be something like "A stands for 'grass is green'". In philosophy, it's common to pick linguistic propositions that we can interpret some truth value for instead of picking T
or F
directly. Either 'it is raining' or not. Because our interpretations are blind to the difference between A
and 'grass is green', we're avoiding linguistic tricks like A
stands for "this proposition is false" or A
stands for "Not A
" by dint of interpreting only symbols and logical connectives.
So it turns out that in propositional logic specifically, you can always discover whether A ⊨ B
by building a truth table. Building these tables quickly gets tiresome and other logics don’t can’t be exhaustively searched this way, so there’s another way to show this entailment.
Proof System Entailment
The second way you can prove that that A ∧ B
entails B ∧ A
in propositional logic is via its proof system.
I'm not going to go over proof rules. I'm assuming that even if you've forgotten the specifics, you remember generally how they were used. Instead, I'll ask you to notice that we don't contend with T
or F
at all while using a proof system. This is because the end-game of a proof system is to cleverly pick rules that only let you assume something to denote T
or to write something that must denote T
. If you somehow convincingly manage the feat of picking your rules this way, you can magically know what your truth table looks like for every relevant line.
Line # | Proof Line | Proof rule that makes this line legal | 1. | A ∧ B | Assumption | 2. | A | ∧ elimination on line 1 | 3. | B | ∧ elimination on line 1 | 4. | B ∧ A | ∧ introduction on line 3 and line 2 |
---|
We say that A ∧ B ⊢ B ∧ A
because we have a proof system where if we assume A ∧ B
, there exists some number of legal lines after which we can legally write B ∧ A
. We can understand ⊢
as a purely syntactic game because we never need to reference T
, or F
, or tables or any such thing. This becomes an interesting game to us because we've designed it so that it can only generate propositions for which ⊨
also holds. We have a term for this relationship. Having this property where A ⊢ B
entails A ⊨ B
makes a proof system sound. (Perhaps less interestingly; for propositional logic it is also the case that A ⊨ B
entails A ⊢ B
, which makes propositional logic’s proof system complete).
Propositional Proofs in General
In propositional logic, the proof system approach is often a much more concise way to show an entailment. For example, A ∧ B ∧ C ∧ D ∧ E ⊨ B ∧ C ∧ D ∧ E ∧ A
takes a 32-line truth table, while A ∧ B ∧ C ∧ D ∧ E ⊢ B ∧ C ∧ D ∧ E ∧ A
takes a 4-line proof (It’s a twin to the 4-line proof from above where B ∧ C ∧ D ∧ E
replaces B
).
The other cool property of a proof system is that as long as we agree to the same rules, anybody can check a proof and confirm that the rules have been followed. The idea that "there's no special intuition required, anybody can mechanistically check the rules" is what ends up making proofs written in a proof system machine checkable. You can write a program that can check the correctness of any proof.
So even in logics where you cannot "write out the truth table," and for which there's no way to know when to stop searching for a proof... at least you can rest assured that if you ever do find a proof, you and others can certainly verify that the proof is correct.
Type Systems
So what does all this propositional logic have to do with Type Systems and with Types?
Well, type systems and proof systems have a lot of similar properties. They both have a clean separation of syntax and semantics, they both offer a purely syntactic set of rules, that anybody (even a program) can check. They both value the relationship between ⊢
and ⊨
so that the rules for what you can do syntactically translate well into the semantics (Few programming languages actually boast a properly sound type system, though they do still value making the distinction as clear as possible).
Where the proof system above is just there to enforce tautologies, type systems exist to enforce abstractions. Specifically to do so in a machine tractable way. What does that even mean?
It sort of hard to go into detail without picking a specific type system, but we're going to pivit first. Lets get back to propositions so we can talk about about Types themselves.
What are Propositions in Propositional Logic?
We're going to keep coming back to the mathematical substrate I was familiar with before types. It's how I started to build intuitions on the topic, but more importantly for this blog the idea that types and propositions are related is a deeply researched topic. We'll benefit from scratching the surface a bit. I'd like to take a quick detour into what a proposition is because talking about the meaning of things which themselves might denote meaning is more difficult than it first appears.
So when I open my undergrad textbook, the definition of a proposition in propositional logic is as follows:
A proposition is a statement that can be either true or false; it must be one or the other, and it cannot be both.
To help buttress this definition, I may give some examples of statements which are or are not propositions.
The following are propositions:
- The reactor is on.
- The wing-flaps are up.
- John Major is prime minister.
whereas the following are not:
- Are you going out somewhere?
- 2 + 3
This definition leans on a lot of other definitions and intuitions that don't actually surface in propositional logic at all. For starters, you'd be excused for thinking propositions are linguistic expressions, but philosophically the English sentence "Snow is white" denotes the same proposition as the German sentence "Schnee ist weiß" even though the two sentences are not the same. A more nuanced definition may say something about propositions being "the sort of thing that declarative sentences denote".
This is all rather abstract and you can write a PhD Thesis on how linguistic expressions do/have embedded and deviate from formal logic. That sends us down the wrong rabbit whole for our purposes. Really, propositions in propositional logic are the building blocks of a sort of syntactic game that gets interesting and applicable once you imbue it with meaning.
Syntactically, there are a set of base propositions. For example the elements of {P, Q, R}
are propositions because we've defined them to be. Then there are a small set of logical connectives and rules for how they're to be used to juxtapose propositions. So the elements of {¬, ∧, ∨, →, ↔}
are logical connectives. I won't spell out the rules for how to determine a well-formed formula. For example: P → Q
, P → ¬Q
, and P ∨ ¬P
are all well formed but P Q ∧
, P ∧ ∧ Q
, and ¬∧P
are all not well formed. Then we perform the neat trick of saying that any well formed formula is definitionally a proposition.
So assuming you learned the rules for a well-formed formula and you're armed with a set potentially infinite base propositions, you can simply say that propositional logic propositions are exactly these things. In a sense you can stop here. There are infinitely many of them, but you can create and verify every possible proposition and that's the end of it. You've answered the question: "What are Propositions in Propositional Logic?" as best as it can be answered.
What comes next, the "giving them meaning" part can be done in any number of ways. Most of the semantics we could give this system serve no purpose. For example, I can get outright silly with it and say something like "all propositions denote my left pinky finger". I make the syntactic game above somewhat silly because no matter how I puzzle together big complex propositions, all they'll ever do is denote my left pinky finger. That's not helpful.
So instead, let me quote Wikipedia:
propositions are often modeled as functions which map a possible world to a truth value. For instance, the proposition that the sky is blue can be modeled as a function which would return the truth value T if given the actual world as input, but would return F if given some alternate world where the sky is green.
Such functions deterministically tell us how to interpret any proposition as T or F. If they interpret the logical connectives properly, they imbue the entire enterprise with a structured meaning that is interesting by design.
Two quick notes on the side:
-
If you view T as the singleton set and F as the empty set, you can see the standard set theoretic semantics of propositional logic as a generalization of this semantic interpretation of predicate logic. So you can define the logical connectives above using truth tables (in the style I used earlier) or you can define them as set operations like unions, intersections and such. The approaches are the same (up to isomorphism).
-
I'm writting a post which models the semantics of propositions in Lambda Calculus using JavaScript you can easily run in your browser. I'll link it here. It's a completely useless excersize for day-to-day programming, but it completely amazed me and I hope it does the same for you.
About Types
Types are a syntactic tool for abstraction and can be talked about purely as a way to add structure to the question of whether some piece of syntax is well formed.
Typically a type theory will have a few base types and then a few type-forming rules. Then types are straightforwardly defined as the (potentially infinite) hierarchy of types that can be generated. This is defining types by just looking at the syntactic game that creates them — seem familiar?.
So really, what's the point of the syntactic type game loosely described above? Types as they're discussed in the more advanced type theory circles have a direct categorical semantics. As friend of mine jokingly says, "category theory is general abstract nonsense" (based on how prominent representatives of the mathematical community reacted to its introduction), so we're going to shy away from that for the moment.
A way to model the semantics of types of a simple type theory is as sets of values. To quote:
"Types” denote nonempty sets of values; they are used to restrict the scope of variables, control the formation of expressions, and classify expressions by their values.
- William M. Farmer (link)
When you open a Java Class file and see:
int a ;
or a Rust file and see:
let a: i32 = 5;
int and i32 are types and in this case and both denote a 32-bit signed two's complement integer. The variable a
is restricted in scope to exactly one of int's 232 (or 4294967296) possible inhabitants. There's some loose notion under which a
denotes an element of int as though int is a set. If I had written int a = 5.3,
I would have received a compilation error saying the expression is not well typed. The error would say something to the effect there's no meaningful way to assign a floating point number to a variable meant to hold an integer.
Type systems and type theories use types as a syntactic tool for abstraction, but they don't need to be so embedded to be useful. Consider Hungarian notation, which is an identifier naming convention that prefaces variable names with their data type. For example, the b prefacing bBusy denotes a boolean and nSize is an integer. This was popularized in a language called BCPL because BCPL has no data types other than the machine word, nothing in the language itself helps a programmer remember variables' types.
Other untyped languages hold such information in the comments or through other conventions. It's important to note that just like assembly or BCPL still have types in the margins, typed languages also continue that trend. Developers can and do use the type integer as a bit-field or the type string as tags to approximate a disjoint union. Just consider that you can write a Haskell program that only uses values of type SequenceOfBits (allowing a wrapper around the std library) to create most any program you'd like, though I imagine you'd have a hard time doing so without making many many many mistakes along the way.
At their core, types seem like a very simple, but frustratingly abstract thing. They have a somewhat liminal nature, existing in many forms directly in syntax — where rules inductively define well typed syntax — and also in the meta-language — where we leave it as an exercise to the reader to understand why some expressible syntax is not necessarily meaningful.
For example, Rust's type system infers every variable below to be an i32. If I were writing a chair-based library for millions of developers to use, I would probably employ Rust's type system to ensure such code never compiles. If I'm writing a quick script, I may just stick with i32 and be careful about what I write.
let chair_colour = 15761536; // #F08080 HTML standard Hex colour code
let chair_height = 458; // millimetres
// The following is almost certainly a bug
let syntactically_legal_nonsense = chair_colour + chair_height;
A More Nuanced View on Types
To develop a nuanced view of what types you need to understand them in the context of how and why they're used. What motivates the use of types? When we look at the ways in which they're used in older imperative programming languages, a simple answer might be "so that the compiler can calculate how much memory some data takes and how to represent (persist) the data in memory so that I don't have to do it myself". To support this line of reasoning, these languages always give you a way to build more complex types from simpler types. Lets run with an example, most languages allow you to define a new product type called a pair from any other two types: ∀ A:type B:type | (A, B):type
. This reads "The pairing of any two types is itself a type". A use for this is that the compiler can understand how a type can be represented based on its construction. How a pair is persisted in memory depends on the types in question, but you can describe what to do generically. For each way you can build up more complex types (like ∀ A:type | [A]:type
, or ∀ A:type | Vec<A>:type
), you can equip the compiler with rules that let it understand what to do. So the compilor contains something akin to a function from types to memory layout, but just like you can create new proposition using the logical connectives, you can create new types using some given connectives.
It turns out that we want the rules for how to persist a type in memory to have some predictable mathimatical structure. Like if we want to let the compilor optimise memory layouts, we need to feed it rules for how types can be re-written. Why should we allow the following re-write:
Point {
x: Int,
y: Int
}
GameNode {
tag: String,
parent: Point,
pos: Point
}
to
GameNode {
tag: String,
parent_x: Int,
parent_y: Int,
pos_x: Int,
pos_y: Int
}
or even
GameNode (String, Int, Int, Int, Int)
There's a deeper structure here, because just like predicates and propositions become useful in the context of a logic that declares how they can be juxtaposed with various operators (syntax) and a model for how to evaluate any legal syntax to it's meaning (semantics). Types become useful in the context of a theory that declares how they can be juxtaposed with various operators (syntax) and a model for how to evaluate any legal syntax to it's meaning (semantics). You're abstracting something more interesting than "memory layout" when you build types and let the compiler figure out the rest. You're actually annotating your program with a system that has a structure you can hijack for your own means to enforce program-wide properties.
I don't think the following is necessary to develop a deeper understanding of types, but if you've taken a course or done some reading on first-order logic, then I recommend perusing this paper (link). It's a paper that hopes to recommend teaching a simplified type theory to undergrads. Assuming you have a bit of background with first-order logic, the paper is very accessible. A highlight is where it shows that although simple type theory has none of the propositional connectives or quantifiers, they can all be easily defined in simple type theory using function application, function abstraction, and equality. I'll let the paper speak for itself, but to me the power of Simple Type Theory is that is has a very uniform syntax, but hidden within a hierarchy of types is the full power of higher-order predicate logic.
The theory presented in the paper is type theory boiled down to its essence. It is convenient for study, but it is not highly practical for use. This is where extensions to the theory and macro-expansions and such pick up the slack.
In Conclusion
The thing to consider then, is that when a computer scientist is using Int to label a variable, they can be seen as abstracting away the representation of an Int's value as a series of bits, or as annotating their program with something akin to a logic. What's been done when annotating a variable with an Int is very simple, but as you build complex types from simpler types and construct tuples, lists, sets, etc you're building up more complex logical expressions that let you abstract away the representation of increasingly complex data. You can use function abstraction to view programs themselves as sequences of bits that need abstracting. This lets a developer create boundaries across which the way in which entire parts of your program are represented is abstracted.
In functional programming languages with relatively expressive type systems, it's very common to see the sorts of choices made in imperative languages being encoded as types instead. That way instead of writing out if, else if, else
statements, you generate a value of some type and by cleverly encoding how these types are allowed to be composed, you're on you way to ensuring certain bad sequences of choices (now types) are not possible to create without a type error. Carefully crafting and letting a type system enforce such invariants for you matters more as software size, number of developers, and magnitude of maintenance increases.
At a minimum you're adding a logic system that sits on-top of your program and describes some part of it (for example: Java, C#, TypeScript, Go, etc). At the extreme end, you've realized the depths of the Curry–Howard Correspondence and you're a developer writing executable math proofs (for example: Idris, Agda, Coq, Lean, etc).