CodeFlow: Predicting Program Behavior with Dynamic Dependencies Learning

Introduction

We introduce CodeFlow, a novel machine learning-based approach designed to predict program behavior by learning both static and dynamic dependencies within the code. CodeFlow constructs control flow graphs (CFGs) to represent all possible execution paths and uses these graphs to predict code coverage and detect runtime errors. Our empirical evaluation demonstrates that CodeFlow significantly improves code coverage prediction accuracy and effectively localizes runtime errors, outperforming state-of-the-art models.

Paper: CodeFlow: Predicting Program Behavior with Dynamic Dependencies Learning

Installation

To set up the environment and install the necessary libraries, run the following command:

./setup.sh

Architecture

CodeFlow consists of several key components:

CFG Building: Constructs CFGs from the source code.
Source Code Representation Learning: Learns vector representations of CFG nodes.
Dynamic Dependencies Learning: Captures dynamic dependencies among statements using execution traces.
Code Coverage Prediction: Classifies nodes for code coverage using learned embeddings.
Runtime Error Detection and Localization: Detects and localizes runtime errors by analyzing code coverage continuity within CFGs.

Usage

Running CodeFlow Model

To run the CodeFlow model, use the following command:

python main.py --data <dataset> [--runtime_detection] [--bug_localization]

Configuration Options

--data: Specify the dataset to be used for training. Options:
- CodeNet: Train with only non-buggy Python code from the CodeNet dataset.
- FixEval_complete: Train with both non-buggy and buggy code from the FixEval and CodeNet dataset.
- FixEval_incomplete: Train with the incomplete version of the FixEval_complete dataset.
--runtime_detection: Validate the Runtime Error Detection.
--bug_localization: Validate the Bug Localization in buggy code.

Example Usage

Training with the CodeNet dataset(RQ1):
```
python main.py --data CodeNet
```
Training with the complete FixEval dataset and validating Runtime Error Detection(RQ2):
```
python main.py --data FixEval_complete --runtime_detection
```

Training with the complete and incomplete FixEval dataset and validating Bug Localization(RQ3):

python main.py --data FixEval_complete --bug_localization
python main.py --data FixEval_incomplete --bug_localization

Fuzz Testing with LLM Integration (RQ4)

After training CodeFlow and saving the corresponding checkpoint, you can utilize it for fuzz testing by integrating it with a Large Language Model (LLM). Use the following command:

python fuzz_testing.py --checkpoint <number> --epoch <number> --time <seconds> --claude_api_key <api_key> --model <model_name>

checkpoint: The chosen checkpoint.
epoch: The chosen epoch of checkpoint.
time: Time in seconds to run fuzz testing for each code file.
claude_api_key: Your API key for Claude.
model: Model of Claude, default is claude-3-5-sonnet-20240620.

Example

python fuzz_testing.py --checkpoint 1 --epoch 600 --time 120 --claude_api_key YOUR_API_KEY --model claude-3-5-sonnet-20240620

Generating Your Own Dataset

To generate your own dataset, including CFG, forward and backward edges, and the true execution trace as ground truth for your Python code, follow these steps:

Navigate to the generate_dataset folder:
```
cd generate_dataset
```
Place your Python code files in the dataset folder.
Run the dataset generation script:
```
python generate_dataset.py
```

To build and visualize CFG for a Python file, use this command:

python cfg.py \directory_to_Python_file

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgements

This codebase is adapted from:

Citation Information

If you're using CodeFlow, please cite using this BibTeX:

@misc{le2024learningpredictprogramexecution,
      title={Learning to Predict Program Execution by Modeling Dynamic Dependency on Code Graphs}, 
      author={Cuong Chi Le and Hoang Nhat Phan and Huy Nhat Phan and Tien N. Nguyen and Nghi D. Q. Bui},
      year={2024},
      eprint={2408.02816},
      archivePrefix={arXiv},
      primaryClass={cs.SE},
      url={https://arxiv.org/abs/2408.02816}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
data		data
fuzz_testing_dataset		fuzz_testing_dataset
generate_dataset		generate_dataset
img		img
README.md		README.md
cfg.py		cfg.py
config.py		config.py
data.py		data.py
fuzz_testing.py		fuzz_testing.py
main.py		main.py
model.py		model.py
requirements.txt		requirements.txt
setup.sh		setup.sh
trace_execution.py		trace_execution.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CodeFlow: Predicting Program Behavior with Dynamic Dependencies Learning

Introduction

Paper: CodeFlow: Predicting Program Behavior with Dynamic Dependencies Learning

Installation

Architecture

Usage

Running CodeFlow Model

Configuration Options

Example Usage

Fuzz Testing with LLM Integration (RQ4)

Example

Generating Your Own Dataset

License

Acknowledgements

Citation Information

About

Languages

FSoft-AI4Code/CodeFlow

Folders and files

Latest commit

History

Repository files navigation

CodeFlow: Predicting Program Behavior with Dynamic Dependencies Learning

Introduction

Paper: CodeFlow: Predicting Program Behavior with Dynamic Dependencies Learning

Installation

Architecture

Usage

Running CodeFlow Model

Configuration Options

Example Usage

Fuzz Testing with LLM Integration (RQ4)

Example

Generating Your Own Dataset

License

Acknowledgements

Citation Information

About

Topics

Resources

Stars

Watchers

Forks

Languages