Archivetar

archivetar (V2) is a collection of several tools intended to make the archiving and the use big data easier. Targeted mostly at the research / HPC use case it is useful in other cases where having fewer files but not one gigantic file is beneficial.

archivetar was to make our Spectrum Archive install Data Den more useful. It has no dependencies on SA, and could easily be used with AWS Glacier, HPSS, DMF etc. Any service where you want to minimize the number of objects keeping the data/object ratio high.

For additional performance archivetar will detect many multi-core capable compressors.

Example Output

# number of files before
$ find . -type f | wc -l
6656

# bundle all files < 1M, into tars 200M in size
# Delete input files
archivetar --prefix boxout --remove-files --size 1M --tar-size 200M

# number of files after
$ find . -type f | wc -l
1831

# expand using tar
for x in $(ls boxout*.tar)
   tar -xf $x
done

# Alternative using provided unarchivetar
unarchivetar --prefix boxout

archivetar vs tar

archivetar doesn't try to replace tar. Actually it uses it internally rather than Pythons native implementation.

Usage

See USAGE.md

Building archivetar

See INSTALL.md

Name		Name	Last commit message	Last commit date
Latest commit History 193 Commits
GlobusTransfer		GlobusTransfer
SuperTar		SuperTar
archivetar		archivetar
bin		bin
mpiFileUtils		mpiFileUtils
singularity		singularity
test		test
.bandit		.bandit
.coveragerc		.coveragerc
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.travis.yml		.travis.yml
INSTALL.md		INSTALL.md
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
USAGE.md		USAGE.md
build.sh		build.sh
pyproject.toml		pyproject.toml
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Archivetar

Example Output

archivetar vs tar

Usage

Building archivetar

About

Releases 1

Packages

Contributors 5

Languages

License

brockpalen/archivetar

Folders and files

Latest commit

History

Repository files navigation

Archivetar

Example Output

archivetar vs tar

Usage

Building archivetar

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 5

Languages

Packages