Integer Overflow in Ethereum

Bio

Mark Yosef - Security Researcher and Data Scientist at Valid Network.

Has an MSc in Information and Software Systems and Engineering from Ben-Gurion University. Specializes in Machine Learning and Big Data, and is extremely passionate about Cybersecurity, Blockchain and Algo-trading. Enjoys collecting Pokémon cards and bobble-heads of rappers

‍

Numbers’ Representation in Computers

Modern computers store data from different types like numbers, text, etc. This article will focus on integer overflow, although other overflows (like buffer overflow) exist. There is an infinite amount of numbers between -∞ and ∞, but our computers have a finite capacity of storage, so there is a limit on how large or small numbers we can store.

In addition, computers don’t understand human languages, so the data is represented in binary, a 2-based number system, which consists of 0’s and 1’s. Sometimes, these values are represented in hexadecimal, which is a 16-based number system, with a ‘0x’ prefix [1].

In many programming languages, every storage slot has a type that defines what kind of information is stored within it, and what the maximum capacity of the specific slot is. In languages like C and Java, these can be represented by multiple types for numbers like ‘int’, ‘short’, ‘double’, etc. An int type declares that the maximum value length that can be stored in a specific slot is 4 Bytes or 32 bits (each Byte consists of 8 bits).

For example:

0x0000000F (Binary: b00…001111) - is a binary/hex representation of the decimal value 15.

Thus, we can deduce that in such languages:

0x00000000 (decimal 0) - is the minimum value of unsigned int.
0xFFFFFFFF (decimal 4294967295) - is the maximum value of unsigned int.

This can be also calculated by 2³² - 1, which is all possible combinations of 0’s and 1’s of length 32, minus 1 because computers start counting from 0.

What is Integer Overflow?

Well, if there is a limit to the size of numbers in computers, what happens when we cross this limit?

In Ethereum, every unsigned int slot in the storage is 32 Bytes or 256 bits. Let’s say you want to perform an arithmetic addition between 2 legitimate but unsigned integers:

A = 0x00…001 (1)
B = 0xFF…FFF (2²⁵⁶ - 1, Max unsigned value)
C = A + B = 0x00…001+ 0xFF…FFF = 1 + (2²⁵⁶ - 1) =
2²⁵⁶ = 0x100…000

What Just Happened?

The result of this arithmetic addition is a number that is greater than the maximum possible integer. It consists of one followed by 256 zeroes (b100…000), which in total has a length of 257 bits. But the slot of the integer value in the storage can only have 256 bits. Therefore, only the 256 RMB (rightmost bits) are stored in the storage, and everything else is ignored.

As a result, the value that is actually stored is 0x00…000 (0), this is an integer overflow.

Ethereum Smart Contracts, EVM & Solidity

There are two types of entities on the Ethereum network:

• User (EOA) - Externally Owned Account which is also called a wallet.

• Smart Contract - an entity that has a code and persistent storage.

When a user or a contract wants to interact with a contract on the Ethereum network, it creates a transaction that invokes some functionality of the called contract code and sends it to the network. Once a potential block miner pulls this transaction from the transaction pool, he executes the transaction using the Ethereum Virtual Machine (EVM) [2].

The miner uses an Ethereum node such as ‘geth’ to interact with the Ethereum network. EVM is a component of the node that is responsible for executing transactions. It starts by getting the contract’s context, the immutable code, and the persistent storage. Then, it executes the called code and stores the changes to the storage. The size of every slot in the EVM data structure (stack/memory/storage) is 32 Bytes or 256 bits.

The code of a contract consists of byte codes, somewhat similar to Java’s JVM and byte codes. These byte codes are complicated for a human to read, so usually they are a product of a compilation process from a high-level programming language like Java, or in our case Solidity.

Solidity [3] is a high-level programming language that can develop smart contracts. Solidity’s compiler compiles the contracts to EVM byte codes that can be deployed to Ethereum’s network. ‍

Integer Overflow in Ethereum

In Solidity, you can perform many different operations with numbers. One such case is arithmetic and a problem associated with it is that an integer overflow can occur in such code.

There are two types of integers in Solidity [4]:

uint: unsigned integers - positive numbers ranging from 0 to (2²⁵⁶ - 1) [UINT_MIN, UINT_MAX].
int: signed integers - both positive and negative numbers ranging from -2²⁵⁵ to (2²⁵⁵ - 1), [INT_MIN, INT_MAX].

In signed int, the LMB (leftmost bit) represents the sign of the number and thus signed int. 0 in the LMB stands for positive numbers, and 1 stands for negative numbers. Therefore, the number of bits in a number value is decreased from 256 to 255.‍

At a first look, one can see that in the unsigned number circle we can have either addition of 2 numbers that overflows to a smaller value, or subtraction of 2 numbers that underflows to a greater value. However, in the signed circle and due to the sign we can have both overflow and underflow within the same operation.

Let’s look at the addition of 2 signed numbers for example:

Detection

In this section, we will present the process and challenges of detecting integer overflow in Ethereum.

1. No Indication

While in other software languages and machine codes there is an indication of arithmetic integer overflow (for example, Overflow flag in Assembly [5]), that is not the case for EVM. There is no indication that an overflow has occurred during an execution of a transaction on the EVM. In some cases, you can deduce that an overflow has occurred from the values that are stored after the execution of the transaction. However, you most probably will have to re-run the transaction and find out overflows using different heuristics.

2. Different Arithmetic Operations

Integer overflow/underflow can occur after the addition or subtraction of 2 numbers. However, because the multiplication operation is based on addition, it can cause overflow as well. The same goes for exponent operation which is based on multiplication. So specifically, to EVM, those are some of the opcodes that can cause an integer overflow: ADD, SUB, MUL, EXP [6].

3. Signed Vs Unsigned Arithmetics

Things get even more complicated when we consider the type of operands. As I have mentioned above, the same hexadecimal value in the storage can be interpreted differently based on the type of slot. For example, 0xff…fff is -1 in signed int, but a MAX_UINT (2²⁵⁶ -1) in unsigned int. Therefore, the detection of integer overflows should be aware of the slot types.

Let’s go through an example:

We can see this behavior clearly with the circle of integers visualization.

Generally, signed integers are more complex and may have more overflow issues than unsigned integers. There is also an arithmetic operation that can cause overflow only in signed numbers. When we have 2 unsigned integers A and B (positive and non-fractions), the result of A / B will always be a positive number smaller than A and B. However, let’s look at the division operation (SDIV) edge case with signed numbers:

While -2²⁵⁵ / -1 should be equal to 2²⁵⁵, in hexadecimal value it is 0x80…000. But, in signed integer type, 0x80…000 represents -2²⁵⁵ [INT_MIN]. So instead of getting a positive number by dividing 2 negative numbers, we get a negative number, which in turn is an overflow. This isn’t possible when we use unsigned numbers division.

4. False Positives

Even if we can identify that an overflow has occurred, we can understand what operation caused it and whether the operands are signed or unsigned integers, sometimes it still doesn’t enough. Sometimes an overflow is desirable behavior. Some compilers create an overflow intentionally to run some functionality, and sometimes even the smart contract’s developers base their coding logic on desirable overflows. Therefore, even when we detect an overflow, we can’t be sure whether it is an unexpected behavior that can be a potential vulnerability or a desirable functionality. Thus, the FP (False Positive) rate of integer overflow detection is high.

5. No Source Code and Types

The types of unsigned and signed integers are declared in the high-level programming language, which for us is Solidity for Ethereum. There are no types on the machine code or byte codes level. Therefore, what happens when there is no Solidity source code for a contract? How can we know whether the addition of 2 numbers is a signed or an unsigned addition, without knowing the types of slots storing those numbers?

Let’s look at the following example:

These two pieces of code are almost identical Solidity codes, except for the types of the parameters ‘a’ and ‘b’. They have been compiled to EVM byte codes using Remix IDE [7]. Unfortunately, we can see that the compiled byte codes of the addition operation are identical, even when the types of the parameters are different. Therefore, we can’t distinguish between them based only on the byte codes.

Using Integer Overflow to Perform an Attack on Ethereum Network

BeautyChain (BEC) contract is a great example of using an integer overflow as a vulnerability to perform an attack on a contract. The attacker used the behavior of integer overflow to overcome some security checks and have stolen a huge amount of BEC tokens. A link to a great blog describing the attack is mentioned in [8].

The BEC contract: https://etherscan.io/address/0xc5d105e63711398af9bbff092d4b6769c82f793d#code
The ‘batchTransfer’ transaction that has performed the attack: https://etherscan.io/tx/0xad89ff16fd1ebe3a0a7cf4ed282302c06626c1af33221ebe0d3a470aba4a660f

Prevention

Luckily, some solutions can prevent integer overflow issues.

SafeMath

SafeMath.sol [9, 10] is a well-known library used in many contracts. It provides the basic arithmetic operations but can also check the preconditions and postconditions to understand whether an overflow has occurred. In case it did, the library fails the execution of the transaction and updates the status of the transaction as ‘Reverted’.

Compiler Version

You can compile your code with a newer compiler version [11]. This way, the preventive code of external libraries like SafeMath is embedded in the compiled code. However, be sure to design your code properly to avoid Denial of Service attacks that are based on integer overflow.

Conclusion

Many involved in blockchain do not fully comprehend the impact of software flaws and how they can enable vulnerability. It is critical to understand how numbers are represented with computers, what are signed and unsigned numbers, and what an integer overflow attack is to understand the full scope of vulnerabilities.

Valid Network focuses on providing a holistic solution to deal with such integer overflow issues to mitigate risk and vulnerability in applications. What can seem like a simple issue can lead to catastrophic consequences in the software operations, as can be seen in the examples above leading potentially to exploitable situations.

Without ensuring coding can mitigate such issues the smart contracts and software operations handling digital assets are inherently at risk, and this is why Valid Network provides a holistic solution for dealing with potential attack vectors and software issues that can occur on the Ethereum network. We believe no matter how small or minor the vulnerability the impact it could have has the very real potential to cause incredible damage.

References

It’s time to Deriskify Crypto!

Uncover risks & opportunities in crypto to maximize your gains.

Valid Data’s real-time and predictive insights are used by Cryptocurrency traders and exchanges, as well as investors and hedge funds, to make better investment and trading decisions, to protect the value of their digital assets, and to capitalize on market opportunities that only Valid Network’s technology can uncover.

Try Valid Data

Other Blogs

Aug 19, 2020

Decentralized Applications: The good, the bad, and why should enterprises care?

With the upsurge of blockchain, the emergence of dApps is already a steady trend. Now is the time for enterprises to pay more attention to what’s happening and how it affects them and their target audience.

Dec 14, 2020

The Risks of Broken Access Control and the Blockchain

As blockchain applications are a form of web application, access control is still a common problem even for blockchain developers. But despite its common use, access control is difficult to implement and manage properly, easily leading to a misconfigured security control that leaves an enterprise’s data at risk.

May 25, 2020

Serverless Over Kubernetes - The Clean Way

Whenever you want to login into your social media account, say Twitter, you are expected to provide a secret passage – a password. The website checks if your password is correct and if it is, you are granted access. This mechanism works because Twitter assumes

Jan 11, 2021

Financially Exploiting the Blockchain with Frontrunning

What if you could make USD 1,000,000 in 30 minutes on the blockchain with some scripts, insider knowledge, and the right timing? Unethical, illegal, and difficult to prevent, this threat to the defi is called frontrunning.

May 25, 2020

Tornado.cash Decentralization applications Issues

Tornado.cash is a recently announced dApp on Ethereum that allows private transactions on the otherwise public Ethereum network. Private transactions have been a much sought after feature on Ethereum, with many projects developing such features.

Oct 1, 2020

An Introduction to the Enterprise Blockchain Landscape

Enterprise blockchains are starting to gain popularity in recent years. After the big hype around digital currencies has slowly subsided, the interest shifted to the technology that formed the basis for these currencies, the blockchain, and the possibilities it holds for organizations.

Nov 26, 2020

The Risks of Injection Attacks on the Blockchain

Injection attacks are one of the most significant risks to any network-connected system. These attacks use malicious data to attack software systems and can be launched against the client-side of an application, but also against the server-side the database, and the smart contracts.

Feb 1, 2021

Introducing Ethereplay by Valid Network

We are excited to announce Ethereplay by Valid Network, a free community tool to support examining, analyzing, optimizing and securing of smart contract code on Ethereum.

Jul 26, 2020

Onboarding blockchain tech? Don’t miss these important facts

Key issues that enterprises must carefully consider and deal with when onboarding blockchain technology

May 25, 2020

The Reentrancy Strikes Again — The Case of Lendf.Me

DeFi or decentralized finance is a growing sector in the blockchain and cryptocurrency space that defines an ecosystem of decentralized applications providing financial services with no governing authority.

Oct 15, 2021

Insights from Mainnet 2021: The Future is Multichain

Multichain networks will reshape the way we think and experience crypto. From the way we navigate DeFi, send funds with ease across different Blockchains, and move our digital avatars without the worry of fees and delays. Multichains will open the door to a world of opportunities, where digital assets will become the norm. Read below to participate in the revolution.

Apr 29, 2021

Top 2020 Blockchain Hacks (and what you can do about them in 2021)

2020 has seen a rise in both impact of attacks and sophistication over previous years. Attackers stole $3.8 billion -- in just 122 attacks. This blog covers the Top Blockchain Hacks of last year. Looking forward towards 2021, companies utilizing blockchain will continue to face these challenges with both known and novel threats in a highly open and visible ecosystem.

Jun 29, 2021

Who’s Afraid of DeFi?

DeFi and cryptocurrency are the next generation of finance, set to disrupt the mainstream capital markets and financial institutions. Why are banks, investors, and regulators afraid of DeFi? What can overcome the fear? Read our new Valid Network blog.

Aug 23, 2021

Blockchain Vulnerability

Blockchain is an exciting and innovative area, and unlike other nascent technologies, blockchain comes with substantive progress in its brief history, with greater exposure to wider audiences. However, there are also bad actors who are attracted to the ecosystem who try and identify potential Vulnerabilities and exploit these in a way that will allow them to generate a financial gain.

Jul 11, 2021

What are CBDC and are Digital Currencies Safe?

Cryptocurrency and DeFi trading platforms have long signified a coming change in the way currency is handled around the world.

Jul 1, 2021

Integer Overflow in Ethereum

Many involved in blockchain do not have a full comprehension of the impact of software flaws and how they can enable vulnerability.

Integer Overflow in Ethereum

Bio

Numbers’ Representation in Computers

What is Integer Overflow?

What Just Happened?

Ethereum Smart Contracts, EVM & Solidity

Integer Overflow in Ethereum

Detection

1. No Indication

2. Different Arithmetic Operations

3. Signed Vs Unsigned Arithmetics

4. False Positives

5. No Source Code and Types

Using Integer Overflow to Perform an Attack on Ethereum Network

Prevention

SafeMath

Compiler Version

Conclusion

References

It’s time to Deriskify Crypto!

Uncover risks & opportunities in crypto to maximize your gains.

Other Blogs

Decentralized Applications: The good, the bad, and why should enterprises care?

The Risks of Broken Access Control and the Blockchain

Serverless Over Kubernetes - The Clean Way

Financially Exploiting the Blockchain with Frontrunning

Tornado.cash Decentralization applications Issues

An Introduction to the Enterprise Blockchain Landscape

The Risks of Injection Attacks on the Blockchain

Introducing Ethereplay by Valid Network

Onboarding blockchain tech? Don’t miss these important facts

The Reentrancy Strikes Again — The Case of Lendf.Me

Insights from Mainnet 2021: The Future is Multichain

Top 2020 Blockchain Hacks (and what you can do about them in 2021)

Who’s Afraid of DeFi?

Blockchain Vulnerability

What are CBDC and are Digital Currencies Safe?

Integer Overflow in Ethereum

Subscribe to our newsletter and get the latest updates every day

Discover

Community

Legal

Contact