In and Out: Optimizing Overall Interaction in Probabilistic Graphs under Clustering Constraints

Overview

This project is developed as part of the following research paper:

D. Mandaglio, A. Tagarelli, F. Gullo (2020). In and Out: Optimizing Overall Interaction in Probabilistic Graphs under Clustering Constraints. In Procs. of the 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, August 23rd - August 27th, 2020, CA, USA. DOI: https://doi.org/10.1145/3394486.3403190

Please cite the above paper in any research publication you may produce using this code or data/analysis derived from it.

Folders

datasets: contains the original data as well as the preprocessed data (as described in the paper). Biggest networks (to be unzipped in datasets folder) can be downloaded at the following link
code: it contains this project code and links to the competing methods
output: it stores all results produced by the algorithms

Usage

From the folder 'optimize_interactions/code', run the following command:

run_offline_algorithm.py [-h] -d DATASET -a {MIL,D-MIL} [-s SEED]
                                [-r RUNS] [--hill_climbing [HILL_CLIMBING]]
                                [-I HILL_CLIMBING_ITERS]
                                [--save_only_avg [SAVE_ONLY_AVG]]

Dependencies

numpy
python-igraph

Positional arguments

  -d DATASET, --dataset DATASET
                        Input dataset, whose name identifies a particular subfolder in 'datasets/'
  -a {MIL,D-MIL}, --alg {MIL,D-MIL}
                        Selected algorithm

Optional arguments

  -s SEED, --seed SEED  Random generation seed -- for reproducibility (default value 100)
  -r RUNS, --runs RUNS  Number of runs of the selected randomized algorithm (default value 50) 
  --hill_climbing [HILL_CLIMBING]
                        It enables the hill climbing postprocessing step
  -I HILL_CLIMBING_ITERS, --hill_climbing_iters HILL_CLIMBING_ITERS
                        Maximum number of iterations for the hill climbing
                        algorithm (default value 8)
  --save_only_avg [SAVE_ONLY_AVG]
                        if True (default value), only average results across several runs are
                        stored, otherwise the results for each run is saved in a specific sub-
                        folder in 'output/'

Default values for -s, -r and -I correspond to the settings relating to the results presented in the paper.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
code		code
datasets		datasets
output		output
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

In and Out: Optimizing Overall Interaction in Probabilistic Graphs under Clustering Constraints

Overview

Folders

Usage

Dependencies

Positional arguments

Optional arguments

About

Releases

Packages

Languages

License

Ralyhu/optimize_interactions

Folders and files

Latest commit

History

Repository files navigation

In and Out: Optimizing Overall Interaction in Probabilistic Graphs under Clustering Constraints

Overview

Folders

Usage

Dependencies

Positional arguments

Optional arguments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages