Random Interpretation (Random Testing + Abstract Interpretation)

Established: December 1, 2008

Randomized Algorithms for Formal Verification

Powerpoint Slides on Random Interpretation

Introduction

A sound and complete program analysis is provably hard. We typically pay a price for the hardness of program analysis in terms of having an incomplete (i.e. conservative) analysis, or by having algorithms that are complicated and have long running-time. It is interesting to consider if we can pay for this hardness by compromising soundness a little bit (and thus benefit by having simpler and more efficient and complete algorithms). By compromising soundness, we mean that judgements may be unsound, but with very low probability. Furthermore, we can predict and control this error probability and make it so small that for practical purposes, it does not hurt at all. We can make the error probability smaller than the probability of a hardware error in a computer, or the probability of a meteorite striking one’s computer in the next microsecond. We have developed a set of randomized algorithms for program analysis, which are more efficient and precise than their deterministic counterparts. These algorithms are quite simple, however their probabilistic soundness proofs have been the challenging part. These algorithms seem to have a common theme, which we call random interpretation.

Random interpretation is similar to abstract interpretation, which is a well-known technique for program analysis. The key idea in random interpretation is to execute the code fragment on a few random inputs in a contrived manner, which includes giving random linear interpretations to the operators in the program. Both branches of a conditional are executed on each run and at joint points we perform a random affine combination of the joining states. In the branches of an equality conditional we adjust the data values on the fly to reflect the truth value of the guarding boolean expression.

We have described random interpretation schemes for the theory of linear arithmetic and for the theory of uninterpreted functions. Both these random interpretation schemes have a better computational complexity and are simpler to implement than the corresponding abstract interpretation schemes. These random interpretation schemes are intraprocedural. Recently, we have described a generic technique to extend any intraprocedural random interpretation scheme to perform an interprocedural analysis (essentially by computing random summaries of the procedures). The next step is to come up with a random interpretation scheme for the combined theory of linear arithmetic and uninterpreted function symbols. More generally, we are working on a generic strategy to combine any two random interpretation schemes.

Can we benefit by applying similar ideas to other program analysis problems and decision procedures for other theories?

- Random interpretation (Phd Dissertation, 2005, Winner of ACM SIGPLAN Outstanding Doctoral Dissertation Award)
- Random interpretation for linear arithmetic (POPL 2003)
- Random intepretation for uninterpreted operators (POPL 2004)
- Random interpretation for interprocedural analysis (POPL 2005)
Opens in a new tab
- Randomized decision procedure (for the uniform word problem) for linear arithmetic and uninterpreted operators (CADE 2003)
Opens in a new tab
- Randomized path-sensitive analysis for linear arithmetic and uninterpreted operators (SAS 2004)
Opens in a new tab
- Randomized Algorithms (opens in new tab), Rajeev Motwani and Prabhakar Raghavan, Cambridge University Press, June 1995
- On Randomization in Sequential and Distributed Algorithms (opens in new tab), Rajiv Gupta, Scott A. Smolka and Shaji Bhaskar, ACM Computing Surveys, Volume 26, Issue 1, March 1994
Opens in a new tab
- George Necula (opens in new tab)
Opens in a new tab

People

Sumit Gulwani

Distinguished Scientist and Vice President

Learn more