Commit graph

1101 commits

Author SHA1 Message Date
wiiznokes
591ce5d044
fix: remove executable bit (#238)
Co-authored-by: Chris Hennick <4961925+Pr0methean@users.noreply.github.com>
2024-11-19 19:56:41 +00:00
Radmir
ff877df425
fix(lzma): fixed panic in case of invalid lzma stream (#259)
Co-authored-by: Chris Hennick <4961925+Pr0methean@users.noreply.github.com>
2024-11-19 19:55:47 +00:00
ardurin
0ea2744c89
Add function to raw copy files and update metadata (#260)
Co-authored-by: Chris Hennick <4961925+Pr0methean@users.noreply.github.com>
2024-11-19 19:54:32 +00:00
David Caldwell
73143a0ad6
perf: Faster cde rejection (#255)
* Use the tempfile crate instead of the tempdir crate (which is deprecated)

https://github.com/rust-lang-deprecated/tempdir?tab=readme-ov-file#deprecation-note

* perf: Add benchmark that measures the rejection speed of a large non-zip file

* perf: Speed up non-zip rejection by increasing END_WINDOW_SIZE

I tested several END_WINDOW_SIZEs across 2 machines:

Machine 1: macOS 15.0.1, aarch64 (apfs /tmp)
512:   test parse_large_non_zip  ... bench:  30,450,608 ns/iter (+/- 673,910)
4096:  test parse_large_non_zip  ... bench:   7,741,366 ns/iter (+/- 521,101)
8192:  test parse_large_non_zip  ... bench:   5,807,443 ns/iter (+/- 546,227)
16384: test parse_large_non_zip  ... bench:   4,794,314 ns/iter (+/- 419,114)
32768: test parse_large_non_zip  ... bench:   4,262,897 ns/iter (+/- 397,582)
65536: test parse_large_non_zip  ... bench:   4,060,847 ns/iter (+/- 280,964)

Machine 2: Debian testing, x86_64 (tmpfs /tmp)
512:   test parse_large_non_zip  ... bench:  65,132,581 ns/iter (+/- 7,429,976)
4096:  test parse_large_non_zip  ... bench:  14,109,503 ns/iter (+/- 2,892,086)
8192:  test parse_large_non_zip  ... bench:   9,942,500 ns/iter (+/- 1,886,063)
16384: test parse_large_non_zip  ... bench:   8,205,851 ns/iter (+/- 2,902,041)
32768: test parse_large_non_zip  ... bench:   7,012,011 ns/iter (+/- 2,222,879)
65536: test parse_large_non_zip  ... bench:   6,577,275 ns/iter (+/- 881,546)

In both cases END_WINDOW_SIZE=8192 performed about 6x better than 512 and >8192
didn't make much of a difference on top of that.

* perf: Speed up non-zip rejection by limiting search for EOCDR.

I benchmarked several search sizes across 2 machines
(these benches are using an 8192 END_WINDOW_SIZE):

Machine 1: macOS 15.0.1, aarch64 (apfs /tmp)
whole file:   test parse_large_non_zip              ... bench:   5,773,801 ns/iter (+/- 411,277)
last 128k:    test parse_large_non_zip              ... bench:      54,402 ns/iter (+/- 4,126)
last 66,000:  test parse_large_non_zip              ... bench:      36,152 ns/iter (+/- 4,293)

Machine 2: Debian testing, x86_64 (tmpfs /tmp)
whole file:   test parse_large_non_zip              ... bench:   9,942,306 ns/iter (+/- 1,963,522)
last 128k:    test parse_large_non_zip              ... bench:      73,604 ns/iter (+/- 16,662)
last 66,000:  test parse_large_non_zip              ... bench:      41,349 ns/iter (+/- 16,812)

As you might expect these significantly increase the rejection speed for
large non-zip files.

66,000 was the number previously used by zip-rs. It was changed to zero in
7a55945743.

128K is what Info-Zip uses[1]. This seems like a reasonable (non-zero)
choice for compatibility reasons.

[1] Info-zip is extremely old and doesn't not have an official git repo to
    link to. However, an unofficial fork can be found here:
    bb0c4755d4/zipfile.c (L4073)

---------

Co-authored-by: Chris Hennick <4961925+Pr0methean@users.noreply.github.com>
2024-11-19 18:49:35 +00:00
Chris Hennick
8abbf0e931
ci(fuzz): Switch to alf for faster fuzzing (#245)
* test(fuzz): Migrate to afl++ for fuzzing

* build: Exclude new fuzz binaries

* chore: Fix new warning

* ci: Use cargo action for format check

* deps: Update constant_time_eq and flate2

* ci: Bug fix for file paths

* ci: Bug fix: working directory is parent of repository root

* ci: Bug fix: remove stray `cd` commands

* ci: Bug fix? Make paths explicitly descend from workspace root

* ci: Bug fix? Assume github.workspace is the repo root

* test(fuzz): Commit files that were previously missing

* ci(fuzz): Bug fix for fuzz_write_with_no_features

* ci(fuzz): Bug fix: no -V arg for cmin

* ci(fuzz): Bug fix: no -a arg for cmin

* Bug fix: replace colons with dashes in filenames

* style: Fix 2 clippy warnings

* style: Fix another clippy warning in some configs

* ci(fuzz): Enable renaming in all fuzz jobs

* ci(fuzz): Fix: need to rename files in multiple dirs

Signed-off-by: Chris Hennick <4961925+Pr0methean@users.noreply.github.com>

* ci(fuzz): Install `rename` tool

* ci(fuzz): Fix redundant steps and too-late install of `rename`

* ci(fuzz): fix? replace multiple colons

---------

Signed-off-by: Chris Hennick <4961925+Pr0methean@users.noreply.github.com>
2024-11-19 14:25:55 +00:00
Richard Ivánek
1f2957db1f
fix: resolve new clippy warnings on nightly (#262) 2024-11-18 22:31:26 +00:00
Richard Ivánek
06632924e8
fix: resolve clippy warning in nightly (#252) 2024-10-21 05:05:49 +00:00
nick
af33ed343d
feat: Expose ZipArchive::central_directory_start (#232) 2024-08-11 12:00:08 +00:00
Shun Sakai
3f6768ec5a
docs: Update list of supported features (#230) 2024-08-05 17:15:45 +00:00
Chris Hennick
f803fa0197
test: (#33) Verify that data_start is correct when reading an alignment-padded file (#228)
* test: Add test that `data_start` is correctly detected while reading

* chore: Fix imports
2024-07-29 16:23:32 +00:00
Chris Hennick
6d8ab6224b
fix: (#33) Rare combination of settings could lead to writing a corrupt archive with overlength extra data, and data_start locations when reading the archive back were also wrong (#221)
* fix: Rare combination of settings could lead to writing a corrupt archive with overlength extra data

* fix: Previous fix was breaking alignment

* style: cargo fmt --all

* fix: ZIP64 header was being written twice

* style: cargo fmt --all

* ci(fuzz): Add check that file-creation options are individually valid

* fix: Need to update extra_data_start in deep_copy_file

* style: cargo fmt --all

* test(fuzz): fix bug in Arbitrary impl

* fix: Cursor-position bugs when merging archives or opening for append

* fix: unintended feature dependency

* style: cargo fmt --all

* fix: merge_contents was miscalculating new start positions for absorbed archive's files

* fix: shallow_copy_file needs to reset CDE location since the CDE is copied

* fix: ZIP64 header was being written after AES header location was already calculated

* fix: ZIP64 header was being counted twice when writing extra-field length

* fix: deep_copy_file was positioning cursor incorrectly

* test(fuzz): Reimplement Debug so that it prints the method calls actually made

* test(fuzz): Fix issues with `Option<&mut Formatter>`

* chore: Partial debug

* chore: Revert: `merge_contents` already adjusts header_start and data_start

* chore: Revert unused `mut`

* style: cargo fmt --all

* refactor: eliminate a magic number for CDE block size

* chore: WIP: fix bugs

* refactor: Minor refactors

* refactor: eliminate a magic number for CDE block size

* refactor: Minor refactors

* refactor: Can use cde_start_pos to locate ZIP64 end locator

* chore: Fix import that can no longer be feature-gated

* chore: Fix import that can no longer be feature-gated

* refactor: Confusing variable name

* style: cargo fmt --all and fix Clippy warnings

* style: fix another Clippy warning

---------

Signed-off-by: Chris Hennick <4961925+Pr0methean@users.noreply.github.com>
2024-07-29 02:24:07 +00:00
Chris Hennick
3ecd65176c
refactor: Eliminate some magic numbers and unnecessary path prefixes (#225)
* refactor: eliminate a magic number for CDE block size

* refactor: Minor refactors

* refactor: Can use cde_start_pos to locate ZIP64 end locator

* chore: Fix import that can no longer be feature-gated

* chore: Fix import that can no longer be feature-gated
2024-07-28 01:43:44 +00:00
Chris Hennick
a29b860395
test(fuzz): Make cargo fuzz fmt fuzz_write output more reliably equivalent to the code path it follows (#224) 2024-07-26 21:42:03 +00:00
Danny McClanahan
a7c1230dfa
publicly export and document the zip64 threshold constants (#79)
- add doctest for ZIP64_BYTES_THR
2024-07-20 01:52:06 +00:00
Chris Hennick
a60bd79826
Merge pull request #210 from a1phyr/multiple_refactors
Multiple refactors
2024-07-20 01:29:39 +00:00
Chris Hennick
7471cf526f
refactor: change invalid_state() return type to io::Result<T>
Signed-off-by: Chris Hennick <4961925+Pr0methean@users.noreply.github.com>
2024-07-19 13:32:23 -07:00
Chris Hennick
c14986806a Fix divergence from origin/master 2024-07-18 21:02:19 +00:00
Chris Hennick
81b5fb6088 Update fuzz_write corpus to use only smaller entries 2024-07-18 21:02:16 +00:00
Chris Hennick
6106a2bf0b
Merge pull request #201 from nichmor/fix/soft-links-should-remain-the-same
fix: soft links should remain the same
2024-07-18 17:26:17 +00:00
Chris Hennick
6b797b1ba9
Merge pull request #64 from zip-rs/oldpr368
feat: Added function to get if a file is encrypted or not
2024-07-17 17:25:50 +00:00
Chris Hennick
5632e7f25a
Merge pull request #69 from zip-rs/oldpr369
feat: Add by_name_seek() for Stored zips
2024-07-17 17:25:19 +00:00
Chris Hennick
b8c145717b
Merge pull request #212 from a1phyr/improve_unsafe_code
refactor: Improve `FixedSizeBlock`
2024-07-17 17:24:58 +00:00
Chris Hennick
97d0a345f1
Merge pull request #66 from zip-rs/oldpr373
feat: Add ZipWriter::set_file_metadata()
2024-07-17 17:23:33 +00:00
Benoît du Garreau
2e679997b0 Avoid an intermediary buffer in LZMA decoder 2024-07-16 10:54:52 +02:00
Benoît du Garreau
e9b13121cc Make make_crypto_reader take ZipFileData directly 2024-07-16 10:54:52 +02:00
Benoît du Garreau
deb71baf9b Remove crypto_reader field from ZipFile 2024-07-16 10:54:51 +02:00
Benoît du Garreau
b01d5c9b1f Split reader and decompressor 2024-07-16 10:47:11 +02:00
Chris Hennick
86568263c6
fix: Panic when reading a file truncated in the middle of an XZ block header 2024-07-15 15:07:21 -07:00
Chris Hennick
43db4be237
Update src/write.rs
Signed-off-by: Chris Hennick <4961925+Pr0methean@users.noreply.github.com>
2024-07-15 09:04:41 -07:00
Chris Hennick
bde1bb9ef1
Merge branch 'master' into fix/soft-links-should-remain-the-same 2024-07-15 09:01:34 -07:00
Benoît du Garreau
7a8048b159 Improve FixedSizeBlock
- Remove allocations
- Make unsafe code easier to check
- Prevent potential `repr(Rust)` fields reordering
2024-07-12 11:11:17 +02:00
Benoît du Garreau
83b1273fab Improve several Read methods on ZipFile 2024-07-11 14:31:31 +02:00
nichmor
a3232a2119
Merge branch 'master' into fix/soft-links-should-remain-the-same 2024-07-08 17:15:38 +03:00
Chris Hennick
57f01ba946
chore: Fix build errors 2024-07-06 14:26:37 -07:00
Chris Hennick
8635b16316
Merge branch 'master' into oldpr368 2024-07-06 12:38:27 -07:00
Chris Hennick
4dfa32f666
Merge branch 'master' into oldpr373
Signed-off-by: Chris Hennick <4961925+Pr0methean@users.noreply.github.com>
2024-07-06 12:29:01 -07:00
Chris Hennick
1d551ff23c
Merge branch 'master' into oldpr369
Signed-off-by: Chris Hennick <4961925+Pr0methean@users.noreply.github.com>
2024-07-06 12:19:59 -07:00
Chris Hennick
d45bdccc6a
Merge pull request #199 from yujincheng08/xz
feat: Support XZ decompression
2024-07-05 15:52:38 +00:00
Danny McClanahan
59630c00c6
fix dead code analysis warning 2024-06-27 15:37:45 -04:00
nichmor
955ea393ee fix: read entire target and dont transform symlink to absoulte 2024-06-26 16:24:04 +03:00
Chris Hennick
cbd8aedea4
Merge branch 'master' into xz 2024-06-22 17:04:15 -07:00
Chris Hennick
b6e0a0693b
style: Fix cargo fmt and clippy warnings 2024-06-22 17:03:37 -07:00
Chris Hennick
0807029a63
Merge branch 'master' into xz 2024-06-22 16:58:54 -07:00
Chris Hennick
b051ca3d47
chore: Fix a bug introduced by c934c824 2024-06-22 16:57:49 -07:00
Chris Hennick
5b749c4ed9
Merge branch 'master' into xz 2024-06-21 23:15:47 -07:00
Chris Hennick
fcc4fa93e3
style: Fix a Clippy warning re unnecessary into_iter() 2024-06-21 23:15:23 -07:00
Chris Hennick
c0ede17cd0
Merge branch 'master' into xz 2024-06-21 20:29:30 -07:00
Chris Hennick
e20fd7959a
style: cargo fmt --all 2024-06-21 20:28:43 -07:00
Chris Hennick
9a2391358c
Merge branch 'master' into xz 2024-06-21 20:26:07 -07:00
Chris Hennick
c934c82405
fix: Some archives with over u16::MAX files were handled incorrectly or slowly (#189) 2024-06-21 20:22:15 -07:00