Could a Machine ever write tests for our code

In this talk presented at RubyConf 2012, Loren Segal discusses the potential for machines to automate the writing of tests for Ruby code through formal verification techniques. The session centers around a research project called RubyCorrect, which integrates advanced verification methods into the Ruby programming language, focusing on two primary tools: Ruby ESC (Extended Static Checking) and Ruby Kon (Symbolic Execution).

Key Points:

Formal Verification Overview: The speaker introduces formal verification as a methodology that uses mathematical logic and theorem proving to ascertain program correctness, emphasizing its boring yet essential details. The discussed methodologies include static and runtime checks, highlighting Extended Static Checking (ESC) and Symbolic Execution as the focus of the talk.
Extended Static Checking (ESC): Segal explains how Ruby ESC translates Ruby methods into logical expressions. It verifies code correctness based on specified preconditions and postconditions, utilizing Boolean algebra. As an application, he walks through a Fibonacci sequence example, demonstrating how Ruby ESC identifies precondition violations when changes are made to the code.
Symbolic Execution: In contrast to ESC, which needs established contracts for testing, symbolic execution operates without them — making it more flexible. This method allows the analysis of Ruby code by substituting symbolic values for concrete ones and automatically generating test cases by exploring various execution paths.
Demonstrations: Segal provides demonstrations of both Ruby ESC and symbolic execution, showcasing how each tool analyzes and validates code. He generates tests for the Fibonacci sequence using symbolic execution, illustrating its capacity to handle various input scenarios effectively.
Limitations and Future Work: The speaker candidly discusses limitations of the current Ruby ESC implementation, including incomplete coverage of Ruby's standard library and challenges with dynamic code. For symbolic execution, he acknowledges the difficulty in statically analyzing certain Ruby features and the reliance on annotations. He suggests future directions for improving these tools, including better type inference, implementing Ruby abstractions in Myra, and writing a dedicated symbolic execution engine for Ruby.

Conclusions:

The key takeaway from the presentation is the feasibility of automating some aspects of test generation through machine learning in Ruby. However, while such technologies can support developers, they are not yet fully reliable and come with inherent limitations. The deeper question remains whether these automated systems can be entirely adopted in practical development environments, particularly in handling dynamic aspects of Ruby code.

Could a Machine ever write tests for our code
Loren Segal • Denver, CO • Talk

Date: November 01, 2012
Published: March 19, 2013
Announced: unknown

TDD is a great way to test code, but have you ever wondered if there are ways to leverage the awesome power of computers and help us write better tests? Research in the field of formal verification has shown promising results with tools that analyze programs for logic errors and can even figure out what input values caused those failures. However, until now, none of that research was ever used with Ruby. This talk discusses RubyCorrect, a research project that attempts to apply verification techniques like "extended static checking" and "symbolic execution" to the world of Ruby programs. We look at how these techniques work and how they could potentially improve the kinds of program faults we can detect. Machines that write our tests? So crazy that it just might work!

RubyConf 2012