nach oben

Erschienen in:

Open Access 2020 | OriginalPaper | Buchkapitel

The Move Prover

verfasst von : Jingyi Emma Zhong, Kevin Cheang, Shaz Qadeer, Wolfgang Grieskamp, Sam Blackshear, Junkil Park, Yoni Zohar, Clark Barrett, David L. Dill

Erschienen in: Computer Aided Verification

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Patentsuche

Aus

Abstract

The Libra blockchain is designed to store billions of dollars in assets, so the security of code that executes transactions is important. The Libra blockchain has a new language for implementing transactions, called “Move.” This paper describes the Move Prover, an automatic formal verification system for Move. We overview the unique features of the Move language and then describe the architecture of the Prover, including the language for formal specification and the translation to the Boogie intermediate verification language .

This work was supported by the Stanford Center for Blockchain Research and Novi, a Facebook subsidiary whose goal is to provide financial services that let people participate in the Libra network. The Libra Association manages the Libra network and is an independent, not-for-profit membership organization, headquartered in Geneva, Switzerland.

1 Introduction

The ability to implement arbitrary transactions on a blockchain via so-called smart contracts has led to an explosion in innovative services in systems such as Ethereum [41]. Unfortunately, bugs in smart contracts have led to massive amounts of funds being stolen or made inaccessible [5, 15]. In retrospect, the source of these disasters is fairly obvious: smart contracts operate without a safety net. A fundamental requirement for blockchains is that transactions be automatic and irreversible. Unlike traditional financial applications, there is little opportunity for humans to oversee or intervene in transactions. Indeed, the design of the blockchain is intended to prevent human involvement. The resulting potential havoc that can be caused by a bug in a smart contract makes it essential for these contracts to be correct, without vulnerabilities. Not surprisingly, there is great interest in formal verification and other advanced testing methods for smart contracts, and several verification systems already exist or are under development.

The Libra blockchain [3, 38] is designed to be a foundation for supporting financial services for billions of people around the world. If successful, it could store and manage assets worth billions of dollars, with correspondingly stringent security requirements. The code that modifies the state of the blockchain is especially important. The architecture of the Libra blockchain requires that all such modifications be performed by the Move [12] virtual machine, which executes the well-defined Move instruction set. This architecture means that verification efforts can focus on the correctness of bytecode programs implementing smart contracts, including formally verifying those programs.

Contributions

In this paper, we describe a specification language and formal verification system for Move. If a programmer writes functional correctness properties for a procedure, the Move Prover tool can automatically verify it. Although many similar Floyd-Hoare verifiers exist, widespread adoption has been a challenge because conventional software is large, complex, and uses language features that present difficulties for even the simplest verification tasks. However, we are hopeful that the Move Prover will be used by the majority of Move programmers. There are three reasons for this optimism. First, the Move language has been designed to support verification. Second, we are building a culture of specification from the beginning: each Move module used by the Libra blockchain is being written with an accompanying formal specification. Finally, we are working to make the Move Prover as precise, fast, and user-friendly as possible.

The Move language, the Move Prover, Move programs, and their specifications, have been evolving rapidly, so this description necessarily represents a snapshot of the project at a particular time. However, we expect most of the changes to be improvements and extensions to the basics described here. In the remainder of this paper, we will:

Present a brief overview of Move and explain the language design decisions that facilitate verification (Sect. 2);

Describe how the Move Prover toolchain is implemented (Sect. 3);

Explain the model used to represent Move programs (Sect. 4);

Define the Move specification language and give examples of useful properties it can encode (Sect. 5); and

Demonstrate that the Move Prover can verify important aspects of the Libra core modules (Sect. 6).

2 Background: The Move Language

Move [12] is an executable bytecode language for writing smart contracts and custom transaction logic. Contracts in Move are written as modules that contain record types and procedures. Records in modules may either be struct or resource types—the most novel feature of Move. A resource type has linear [17] semantics, meaning that resources cannot be created, copied, or destroyed except by procedures in its declaring module. Resources allow programmers to encode safe, yet customizable assets that cannot be accidentally (or intentionally) copied or destroyed by code outside the module.

Move is minimal in comparison to most conventional programming languages. The only types besides records are primitives (Booleans, unsigned integers, addresses), vectors, and references (which must be labeled as mutable or immutable, similar to Rust [30]). Records can contain primitives and other records, but not references. Control-flow constructs can be encoded via jumps to static labels in the bytecode.

Move programs execute in the context of a blockchain with modules and resources published under account addresses. To interact with the blockchain, a programmer can write a Move transaction script, a single-procedure program similar to a

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figb_HTML.gif

procedure in a conventional language, that invokes procedures of published modules. This script is then packaged into a cryptographically signed transaction that is executed by validators in the Libra blockchain. As in Ethereum, transaction execution is metered, meaning that computational resources (or “gas”) used when a Move program is executed are measured and must be paid for by the submitter of a transaction (though we note that the Move Prover does not yet reason about gas usage).

Verification-Friendly Design. There are several aspects of Move ’s design that facilitate verification. The first is limited interaction with the environment: to ensure deterministic execution, the language can only read data from the global blockchain state or the current transaction (no file or network I/O). Second, many features that are challenging for verification are absent from Move: concurrency, higher-order functions, exceptions, sub-typing, and dynamic dispatch. The absence of the last feature is particularly notable because it is present in Ethereum bytecode and has contributed to subtle re-entrancy bugs (e.g., [14]). Third, Move has built-in safe arithmetic: overflows and underflows are detected during execution and result in a transaction abort. Finally, many common errors are prevented by the Move bytecode verifier (not to be confused with the Move Prover), a static analyzer that checks each bytecode program before execution (similar to the JVM [26] or CLR [31] bytecode verifier). The bytecode verifier ensures that:

Procedures and struct declarations are well-typed (e.g., linearity of resources)

Dependent modules and procedure targets exist (i.e., static linking)

Module dependencies are acyclic

The operand stack height is the same at the beginning and end of each basic block

A procedure can only touch stack locations belonging to callers via a reference passed to the callee

The global and local memory are always tree-shaped

There are no dangling references

A mutable reference has exclusive access to its referent

Because these checks are run on every Move bytecode program, the prover can rely on them in its own reasoning. Note that this would not be true if the checks were performed by a source language compiler, since bad bytecode programs could be created by compiler bugs or by writing programs directly in the executable bytecode representation.

Limited Aliasing. In the rest of this section, we present an example that explains the memory-related invariants enforced by the Move bytecode verifier (6–8 above). The example in Fig. 1 is written in the Move source language, which can be directly compiled to the Move bytecode representation shown in the comments (note that the Move Prover analyzes the bytecode itself). The

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figc_HTML.gif

procedure accepts two arguments:

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figd_HTML.gif

of type

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Fige_HTML.gif

(a mutable reference to a

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figf_HTML.gif

value stored elsewhere) and

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figg_HTML.gif

of type

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figh_HTML.gif

(an owned

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figi_HTML.gif

value). The purpose of this procedure is to destroy the

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figj_HTML.gif

resource stored in

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figk_HTML.gif

and add its value to the

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figl_HTML.gif

resource referenced by

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figm_HTML.gif

. The first line of the procedure performs the destruction by “unpacking”

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Fign_HTML.gif

(placing the program value bound to its field into the program variable

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figo_HTML.gif

), and the next two lines read the current value of

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figp_HTML.gif

and update it.

The careful reader might wonder: what will happen if

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figq_HTML.gif

is a reference to

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figr_HTML.gif

? In a C-like language, the first line would make

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figs_HTML.gif

into a dangling reference, which would lead to a memory error when it is subsequently used. Fortunately, the Move bytecode verifier ensures that this cannot happen. An owned value like

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figt_HTML.gif

can only be moved (either onto the operand stack or into global storage) if there are no outstanding references to the value. In addition, the bytecode verifier guarantees that no mutable reference can be an ancestor or descendant of another (mutable or immutable) reference in the same local or global data tree. This is a very strong restriction! It ensures that procedure formals that can be mutated (mutable references or owned values) point to disjoint memory locations. For example, an additional formal of type

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figu_HTML.gif

in the code above could not point into the memory of the other formals. Formals that are immutable references may alias with each other, but not with mutable references or owned values. This means it is impossible for an update to a reference to affect the value retrieved by a simultaneously existing reference. These restrictions on the structure of memory enable greatly simplified reasoning about aliased mutable data, a significant challenge for verification in conventional languages.

3 Tool Overview

Figure 2 shows the architecture of the Move Prover. The prover takes as input Move source code annotated with specifications. The overall workflow consists of several steps. First, the specifications are extracted from the annotated code, and the Move source code is compiled into Move bytecode. Next, all stack operations are removed from the bytecode and replaced with operations on local variables, and the stackless bytecode is abstracted into a prover object model. Along a separate path, the specifications are parsed and added to the prover object model. The finalized model is translated to a program in the Boogie intermediate verification language (IVL) [23, 24].

The Boogie program is handed to the Boogie verification system, which generates an SMT formula in the SMT-LIB format [10]. This can then be checked using an SMT solver such as Z3 [32] or CVC4 [9]. If the result of this check is UNSAT, then the specification holds, which is reported to the user. Otherwise, a countermodel is obtained from the SMT-solver, which gets translated back to Boogie. Boogie produces a Boogie-level error report, and this result is analyzed and transformed into a source-level diagnosis that is given back to the user. Using this diagnosis, the user can refine the implementation and/or specification and start the process again.

The prover is written in Rust and can be found in the

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figv_HTML.gif

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figw_HTML.gif

directory in the Libra repository on GitHub [25].¹ We describe the Boogie model and the specification language in more detail in the following sections.

4 Boogie Model

Boogie IVL is a simple imperative programming language that supports local and global variables, branching and loops, and procedures and procedure calls. Boogie is designed for verification, so it also supports pre- and post-conditions, loop invariants, and global axioms. Boogie programs are not executable; instead, they are provided as input to the Boogie verification system, which applies a verification strategy to generate verification conditions (as SMT formulas) [8]. If all of the verification conditions hold, then each procedure ensures its post-conditions, under the assumption that its pre-conditions hold. The variable types supported by Boogie IVL match the sorts supported by SMT solvers, e.g., Booleans, integers, arrays, bitvectors, and datatypes. This makes the translation of Boogie verification conditions into SMT formulas fairly transparent. Boogie is used as a back-end for a wide variety of verification tools. The general strategy is to model the semantics of a source language in Boogie. Then, programs and specifications in the source language can be translated into Boogie IVL and checked using the Boogie verification system. For more details about Boogie, we refer the reader to [1, 7, 23, 24].

Following this pattern, we built a Boogie model for Move bytecode programs. A few highlights of the model are shown in Fig. 3 and described below. For a detailed understanding of the model, we refer the reader to the full Boogie model, which can be found in the Libra repository at

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figx_HTML.gif

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figy_HTML.gif

and to a formalization of the core Move bytecode language described in [13].

As mentioned above, in Move, a data value is either a primitive value (e.g., Boolean, integer, address), a struct (i.e. a record) containing one or more data values, or a vector of data values. Data values are represented in Boogie as the

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figz_HTML.gif

datatype, with one constructor for each primitive type, plus a vector constructor (containing one field: a finite array of

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figaa_HTML.gif

), used to model both vectors and structs.

Because Move supports generic functions (i.e. type-parameterized functions), we define a similar Boogie datatype for types called

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figab_HTML.gif

(not shown). A type-parameterized function can then be represented as a Boogie procedure whose initial arguments are of type

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figac_HTML.gif

(for the type parameters) and whose data arguments are of type

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figad_HTML.gif

(regardless of their actual Move type). The bytecode verifier ensures type-correctness, so we do not check that types are used correctly, but rather assume this is the case (by using Boogie

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figae_HTML.gif

statements as needed).

The

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figak_HTML.gif

and

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figal_HTML.gif

datatypes are mutually recursive, and thus a

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figam_HTML.gif

can be thought of as a finite tree. A primitive

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figan_HTML.gif

is a leaf node of the tree, while a struct or vector

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figao_HTML.gif

is an internal node. A position within the tree can be uniquely identified by a path, which is a sequence of integers. A path specifies a node of the tree by starting at the root node and then following children according to the indices in the path. We model paths as finite arrays (also shown in Fig. 3). This simplifies the specification that two trees are disjoint, which is a necessary precondition in some smart contract functions.

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figap_HTML.gif

can be stored in either local or global state, and references to data in either are allowed as local variables. For simplicity and uniformity, we have a single memory object which is a map from

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figaq_HTML.gif

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figar_HTML.gif

(because memory is a partial function, it also contains a map from

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figas_HTML.gif

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figat_HTML.gif

, which indicates whether a particular location is present in memory). A

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figau_HTML.gif

is either global (indexed by an account address and a type) or local (indexed by an integer). References are then represented as a pair consisting of a location and a path. To model reading from or writing to a reference, the global memory is accessed along the reference’s path. Note that this is done by enumerating cases up to the maximum possible path depth (based on the data structures in the modules being verified).²

Finally, each bytecode instruction is modeled as a procedure modifying local or global state in Boogie. A bytecode program is then translated to a sequence of procedure calls, with

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figav_HTML.gif

statements handling control-flow.

5 Specifications

The Move Prover has a basic specification language for individual functions. Specifications include classical Floyd-Hoare pre-conditions, post-conditions, and a new condition specifying when a function aborts. (We are expanding this functionality to include ghost variables and global invariants for modules.) These conditions are separated from the actual code, in “spec blocks,” which are linked by name to the structure or function being specified, or to the containing module. Specifications never affect the execution of a module. A simplified example based on verifying a real Libra module appears in Fig. 4.

Pre-conditions and post-conditions are standard. Pre-conditions are introduced by the reserved word

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figaz_HTML.gif

and post-conditions are introduced by

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figba_HTML.gif

, and each is followed by a Boolean expression, in a syntax that is very similar to Move, which includes the usual relational and arithmetic operators, record field access, etc. A sub-expression after

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbb_HTML.gif

can be enclosed in

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbc_HTML.gif

, causing the expression to be evaluated using the variable values in the program state immediately after entry to the function, instead of using the program state just before exit from the function. Move functions can return multiple values, so the expressions

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbd_HTML.gif

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbe_HTML.gif

, etc. represent those return values.

Formal verifiers for conventional programming languages treat run-time errors as bugs to be reported. However, as in most smart contract languages, performing an undefined operation in Move, such as division by zero, cancels the entire transaction with no effect on the state except the consumption of some currency to pay for the computational resources consumed by the code that was executed before the error occurred. In Libra, this event is called an abort. Aborts are not necessarily run-time errors in Move. They are the standard way to handle illegal transactions, such as trying to perform an operation that is not authorized by the sender of the transaction.

Instead of treating all possible abort conditions as bugs, the Move Prover allows the user to specify the conditions under which a function is expected to abort. This type of specification is introduced by the reserved word

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbf_HTML.gif

, which is followed by the same kind of expressions that can appear after

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbg_HTML.gif

. When

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbh_HTML.gif

P appears in the specification of a function, the Move Prover requires that the function aborts if and only if P holds. If multiple

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbi_HTML.gif

conditions are specified, there is an error unless the function aborts if and only if the disjunction of all their conditions holds. (This current semantics of

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbj_HTML.gif

is subject to change.)

There are two expressions that are specific to the Libra blockchain. The expression

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbk_HTML.gif

is true iff there is an instance of the type T from module M appearing under account A in the global state tree. In the example of Fig. 4, the first post-condition asserts that the payee account exists after a payment transaction (the payee account might not exist before the payment, in which case it is created). The expression

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbl_HTML.gif

represents the value of type T from module M stored at account A. In the example, this construct accesses the balance values of the sender (the payer), to make sure that the balance covers the payment, and to assert that the payer account balance has decreased by the payment amount if the payment is successful.

Specification Translation. Specifications are translated into

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbm_HTML.gif

and

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbn_HTML.gif

statements in Boogie and combined with the prelude (the Boogie model, see Sect. 4) and the translated Move bytecode for the program.

A global Boolean variable

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbo_HTML.gif

is introduced and assumed to be

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbp_HTML.gif

at the beginning of each procedure. The Boogie code for each instruction sets this flag to

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbq_HTML.gif

for conditions that cause abort, such as undefined operations or failures of explicit Move

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbr_HTML.gif

statements.

The specification translator combines, using logical disjunction, the conditions of all

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbs_HTML.gif

statements into a single expression (called

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbt_HTML.gif

here), which is translated into the Boogie specifications

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbu_HTML.gif

and

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbv_HTML.gif

6 Evaluation

In this section, we report on our experience using the Move Prover. We first demonstrate that it can successfully be used on core modules in the Libra codebase.

Verifying Core Modules. We wrote specifications for all of the functions (25/25) in the Libra module and most of the functions (34/38) in the LibraAccount module (4 functions use features that are not yet supported: non-linear arithmetic and referencing data in the spec that does not appear in the code).³ These are core modules of the Libra system, and their correct execution is crucial. The Move Prover was able to prove all of these specifications in under a minute, as shown below. The modules with their specifications are available in the Move Prover source tree.⁴ The Libra and LibraAccount modules comprise nearly 1300 lines (including specifications). The total size of the generated Boogie files is a little over 14,000 lines, and the generated SMT files are around 52,000 lines. Writing these specifications was quite natural, thanks to the tree-based memory model and to the support for type-generics. Experiments were run on a machine with an Intel Core i9 processor with 8 cores @2.4 GHz and 32 GB RAM, running macOS Catalina.

Move Module	LoC	Boogie LoC	SMT LoC	Functions	Verified	Runtime
Libra	420	3875	11,688	25	25	2.99 s
LibraAccount	867	10,362	40293	38	34	46.66 s

Impact of Move Prover. The Move Prover is co-developed with the Move language itself (which is relatively stable) to ensure that contracts remain correct as the entire toolset evolves. The prover is used in continuous integration, and is beginning to be used to verify contracts in production. As of this writing, the Move Prover hasn’t exposed any serious bugs. However, it has had an impact on how we understand code. An example is a function called

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbz_HTML.gif

(a version with some specifications and comments omitted appears in Fig. 4). This function simply pays money from the account of the sender (who signed the transaction) to

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figca_HTML.gif

. In a previous version of the function, the Prover reported errors for two of the “obvious” specification properties shown. The first specification says that the function always aborts when paying zero Libra, because

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figcb_HTML.gif

aborts unless the amount is positive. However, in the earlier version,

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figcc_HTML.gif

handled the payment to deposit the amount in the account when the account did not yet exist, and that payment was allowed to be zero, violating the specification. The function was rewritten as it appears now, so that the same deposit code is called regardless of whether the payee account was newly created. The last specification says that the payer’s account decreases by

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figcd_HTML.gif

after a successful payment. This condition was violated when the payer and payee were the same, resulting in no decrease. Adding an assert (marked “new!” in the figure) to abort in that useless case makes the specification simpler.

The only other formal verification framework for Move that we are aware of is described in [36], where a high-level approach and some case studies are described, but no implementation details are provided.

The closest work in the literature has been done in the context of verification of solidity smart contracts using Boogie. VERISOL [22] is one tool which formally verifies solidity smart contracts via a translation to Boogie. Its specification language is designed for the specific context of application policies, but general specifications can be given by using solidity assertions. SOLC-VERIFY [19, 20] also uses Boogie to perform formal verification for solidity. It includes an annotation-based specification language and supports a larger feature-set of solidity than VERISOL. Interestingly, the formalization of the solidity persistent memory model presented in [20] is similar to our tree-based memory model for Move, though they were developed independently. One novelty of our model in comparison to theirs is its ability to handle generic functions as discussed in Sect. 4 (generics are supported in Move but not in solidity). Both VERISOL and SOLC-VERIFY target contracts written in solidity, and not in the Ethereum bytecode. In contrast, the Move Prover operates on the Move bytecode.

The solidity compiler itself includes a formal verification framework that works via a direct translation to SMT [2]. Several other tools have focused on specific vulnerability patterns, rather than user-defined specifications [16, 28, 34, 40]. Other theoretical foundations have also been employed for the verification of solidity smart contracts. These include the \(\mathbb {K}\) framework [35] (see, e.g., [21]), F* [29] (see, e.g., [11, 18]), and proof assistants such as Coq [37] (see, e.g., [42, 43]).

Formal verification of Rust [30] programs is also related to the Move Prover, as Move ’s type system has similar characteristics to Rust [30]. Prusti [4] is a tool that leverages Rust ’s type system information to verify Rust programs. It is based on a higher-level intermediate framework called Viper [33] (that internally uses Boogie in some scenarios). Other verification efforts for Rust employ a translation to LLVM and then leverage LLVM-based verification techniques (see, e.g., [6, 27, 39]).

8 Conclusion

In this paper, we introduced the Move Prover, a formal verification tool designed to be an integral part of the process of smart contract development for the Libra platform. Though our initial experience with the Move Prover is positive, there are many avenues for future work that we plan to pursue.

As Move continues to evolve, we expect that some constructs may be easier and more efficient to model by using custom SMT constructs. An example of this is the built-in vector type. Our current model requires the use of quantifiers to compare two vector objects. However, an SMT theory of sequences could be used to model vectors without needing to use quantifiers to define equality. We plan to investigate the use of richer (and possibly custom) SMT theories in our model.

The specifications we have written so far are local in the sense that they deal with only a single execution of a single Move function. However, some properties of the Libra blockchain are inherently global in nature, such as the fact that the total amount of currency should remain constant. We plan to investigate techniques for creating and checking such global specifications.

The current Prover is still in a prototype phase. But the goal is for it to be a product that is usable by everyone who is writing contracts for the Libra platform. We expect that there will be many challenges in producing a user-friendly, industrial-strength tool, but we also look forward to a future where formal specification and verification is a routine part of the development process for Move modules on the Libra blockchain.

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Vorheriges Kapitel Formal Analysis and Redesign of a Neural Network-Based Aircraft Taxiing System with VerifAI

Nächstes Kapitel End-to-End Formal Verification of Ethereum 2.0 Deposit Smart Contract

This paper reflects the state of the Move Prover at github commit https://github.com/libra/libra/tree/6798b1cd50ac7d524d3e494783910b3d7e827eef.

As with most verification approaches based on generating verification conditions, verifying recursive procedures or loops in Boogie requires writing loop invariants, which can be difficult and may also introduce quantifiers, making the problem harder for the underlying SMT solver. We have avoided this so far by relying on bounded iteration, but our roadmap includes full handling of recursion and loops via loop invariants.

Two additional functions in LibraAccount are “native” which means that they are built-in and don’t have any Move code. These are modeled directly in Boogie and are not included in the count here.

To reproduce, run

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbw_HTML.gif

from

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figbx_HTML.gif

in the

https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-53288-8_7/MediaObjects/495638_1_En_7_Figby_HTML.gif

source tree.

Boogie. https://github.com/boogie-org/boogie

Alt, L., Reitwiessner, C.: SMT-based verification of solidity smart contracts. In: Margaria, T., Steffen, B. (eds.) ISoLA 2018. LNCS, vol. 11247, pp. 376–388. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-03427-6_28CrossRef

Amsden, Z., et al.: The Libra Blockchain (2019). https://developers.libra.org/docs/the-libra-blockchain-paper

Astrauskas, V., Müller, P., Poli, F., Summers, A.J.: Leveraging rust types for modular specification and verification. PACMPL 3(OOPSLA), 147:1–147:30 (2019)

Atzei, N., Bartoletti, M., Cimoli, T.: A survey of attacks on ethereum smart contracts (SoK). In: Maffei, M., Ryan, M. (eds.) POST 2017. LNCS, vol. 10204, pp. 164–186. Springer, Heidelberg (2017). https://doi.org/10.1007/978-3-662-54455-6_8CrossRef

Baranowski, M., He, S., Rakamarić, Z.: Verifying rust programs with SMACK. In: Lahiri, S.K., Wang, C. (eds.) ATVA 2018. LNCS, vol. 11138, pp. 528–535. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01090-4_32CrossRef

Barnett, M., Chang, B.-Y.E., DeLine, R., Jacobs, B., Leino, K.R.M.: Boogie: a modular reusable verifier for object-oriented programs. In: de Boer, F.S., Bonsangue, M.M., Graf, S., de Roever, W.-P. (eds.) FMCO 2005. LNCS, vol. 4111, pp. 364–387. Springer, Heidelberg (2006). https://doi.org/10.1007/11804192_17CrossRef

Barnett, M., Leino, K.R.M.: Weakest-precondition of unstructured programs. In: Proceedings of the 6th ACM SIGPLAN-SIGSOFT Workshop on Program Analysis for Software Tools and Engineering, pp. 82–87. Association for Computing Machinery, New York (2005). https://doi.org/10.1145/1108792.1108813

Barrett, C., et al.: CVC4. In: Gopalakrishnan, G., Qadeer, S. (eds.) CAV 2011. LNCS, vol. 6806, pp. 171–177. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-22110-1_14CrossRef

10.

Barrett, C., Stump, A., Tinelli, C.: The SMT-LIB standard: version 2.0. In: Gupta, A., Kroening, D. (eds.) Proceedings of the 8th International Workshop on Satisfiability Modulo Theories, Edinburgh, UK (2010)

11.

Bhargavan, K., et al.: Formal verification of smart contracts: short paper. In: PLAS@CCS, pp. 91–96. ACM (2016)

12.

Blackshear, S., et al.: Move: A language with programmable resources (2019). https://developers.libra.org/docs/move-paper

13.

Blackshear, S., et al.: Resources: A safe language abstraction for money (2020). https://arxiv.org/abs/2004.05106

14.

Buterin, V.: Critical update re DAO (2016). https://ethereum.github.io/blog/2016/06/17/critical-update-re-dao-vulnerability

15.

Chen, H., Pendleton, M., Njilla, L., Xu, S.: A survey on Ethereum systems security: vulnerabilities, attacks and defenses. CoRR abs/1908.04507 (2019)

16.

ConsenSys: Mythril Classic: Security analysis tool for Ethereum smart contracts. https://github.com/skylightcyber/mythril-classic

17.

Girard, J.: Linear logic. Theor. Comput. Sci. 50(1), 1–101 (1987)MathSciNetCrossRef

18.

Grishchenko, I., Maffei, M., Schneidewind, C.: A semantic framework for the security analysis of Ethereum smart contracts. In: Bauer, L., Küsters, R. (eds.) POST 2018. LNCS, vol. 10804, pp. 243–269. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-89722-6_10CrossRef

19.

Hajdu, Á., Jovanovic, D.: solc-verify: A modular verifier for solidity smart contracts. CoRR abs/1907.04262 (2019)

20.

Hajdu, Á., Jovanović, D.: SMT-friendly formalization of the solidity memory model. ESOP 2020. LNCS, vol. 12075, pp. 224–250. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-44914-8_9CrossRef

21.

Hildenbrandt, E., et al.: KEVM: a complete formal semantics of the Ethereum virtual machine. In: CSF, pp. 204–217. IEEE Computer Society (2018)

22.

Lahiri, S.K., Chen, S., Wang, Y., Dillig, I.: Formal specification and verification of smart contracts for azure blockchain. CoRR abs/1812.08829 (2018)

23.

Leino, K.R.M.: This is boogie 2 (2008). https://www.microsoft.com/en-us/research/publication/this-is-boogie-2-2/, manuscript KRML 178

24.

Leino, K.R.M., Rümmer, P.: A polymorphic intermediate verification language: design and logical encoding. In: Esparza, J., Majumdar, R. (eds.) TACAS 2010. LNCS, vol. 6015, pp. 312–327. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-12002-2_26CrossRefMATH

25.

Libra. https://github.com/libra/libra

26.

Lindholm, T., Yellin, F.: The Java Virtual Machine Specification. Addison-Wesley, Reading (1997)

27.

Lindner, M., Aparicius, J., Lindgren, P.: No panic! verification of rust programs by symbolic execution. In: INDIN, pp. 108–114. IEEE (2018)

28.

Luu, L., Chu, D., Olickel, H., Saxena, P., Hobor, A.: Making smart contracts smarter. In: ACM Conference on Computer and Communications Security, pp. 254–269. ACM (2016)

29.

Maillard, K., et al.: Dijkstra monads for all. In: 24th ACM SIGPLAN International Conference on Functional Programming (ICFP) (2019). https://arxiv.org/abs/1903.01237

30.

Matsakis, N.D., Klock II, F.S.: The rust language. Ada Lett. 34(3), 103–104 (2014). https://doi.org/10.1145/2692956.2663188

31.

Meijer, E., Wa, R., Gough, J.: Technical overview of the common language runtime (2000)

32.

de Moura, L., Bjørner, N.: Z3: an efficient SMT solver. In: Ramakrishnan, C.R., Rehof, J. (eds.) TACAS 2008. LNCS, vol. 4963, pp. 337–340. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-78800-3_24CrossRef

33.

Müller, P., Schwerhoff, M., Summers, A.J.: Viper: a verification infrastructure for permission-based reasoning. In: Dependable Software Systems Engineering, NATO Science for Peace and Security Series - D: Information and Communication Security, vol. 50, pp. 104–125. IOS Press (2017)

34.

Nikolic, I., Kolluri, A., Sergey, I., Saxena, P., Hobor, A.: Finding the greedy, prodigal, and suicidal contracts at scale. In: ACSAC, pp. 653–663. ACM (2018)

35.

Rosu, G., Serbanuta, T.: An overview of the K semantic framework. J. Log. Algebr. Program. 79(6), 397–434 (2010)MathSciNetCrossRef

36.

Synthetic Minds Blog: Verifying smart contracts in the move language (2019). https://synthetic-minds.com/pages/blog/blog-2019-09-11.html

37.

The Coq development team: The coq proof assistant reference manual version 8.9 (2019). https://coq.inria.fr/distrib/current/refman/

38.

The Libra Association: An Introduction to Libra (2019). https://libra.org/en-us/whitepaper

39.

Toman, J., Pernsteiner, S., Torlak, E.: Crust: a bounded verifier for rust (N). In: ASE, pp. 75–80. IEEE Computer Society (2015)

40.

Tsankov, P., Dan, A.M., Drachsler-Cohen, D., Gervais, A., Bünzli, F., Vechev, M.T.: Securify: practical security analysis of smart contracts. In: ACM Conference on Computer and Communications Security, pp. 67–82. ACM (2018)

41.

Wood, G.: Ethereum: a secure decentralised generalised transaction ledger (2014). https://ethereum.github.io/yellowpaper/paper.pdf

42.

Yang, Z., Lei, H.: Formal process virtual machine for smart contracts verification. CoRR abs/1805.00808 (2018)

43.

Yang, Z., Lei, H.: Fether: an extensible definitional interpreter for smart-contract verifications in Coq. IEEE Access 7, 37770–37791 (2019)CrossRef

Titel: The Move Prover
verfasst von: Jingyi Emma Zhong
Kevin Cheang
Shaz Qadeer
Wolfgang Grieskamp
Sam Blackshear
Junkil Park
Yoni Zohar
Clark Barrett
David L. Dill
Verlag: Springer International Publishing
Buch: Computer Aided Verification
Print ISBN: 978-3-030-53287-1

Electronic ISBN: 978-3-030-53288-8

Copyright-Jahr: 2020
DOI: https://doi.org/10.1007/978-3-030-53288-8_7

Springer Professional

The Move Prover

Abstract

1 Introduction

2 Background: The Move Language

3 Tool Overview

4 Boogie Model

5 Specifications

6 Evaluation

8 Conclusion

Premium Partner

Springer Professional

Abstract

1 Introduction

2 Background: The Move Language

3 Tool Overview

4 Boogie Model

5 Specifications

6 Evaluation

7 Related Work

8 Conclusion

Premium Partner