Skip to content

Commit

Permalink
[Tech] Bump pypdf from 3.17.4 to 4.2.0 in /datascience (#3250)
Browse files Browse the repository at this point in the history
Bumps [pypdf](https://github.com/py-pdf/pypdf) from 3.17.4 to 4.2.0.
<details>
<summary>Release notes</summary>
<p><em>Sourced from <a
href="https://github.com/py-pdf/pypdf/releases">pypdf's
releases</a>.</em></p>
<blockquote>
<h2>Version 4.2.0, 2024-04-07</h2>
<h2>What's new</h2>
<h3>New Features (ENH)</h3>
<ul>
<li>Allow multiple charsets for NameObject.read_from_stream (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2585">#2585</a>)
by <a
href="https://github.com/pubpub-zz"><code>@​pubpub-zz</code></a></li>
<li>Add support for /Kids in page labels (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2562">#2562</a>)
by <a
href="https://github.com/stefan6419846"><code>@​stefan6419846</code></a></li>
<li>Allow to update fields on many pages (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2571">#2571</a>)
by <a
href="https://github.com/pubpub-zz"><code>@​pubpub-zz</code></a></li>
<li>Tolerate PDF with invalid xref pointed objects (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2335">#2335</a>)
by <a
href="https://github.com/pubpub-zz"><code>@​pubpub-zz</code></a></li>
<li>Add Enforce from PDF2.0 in viewer_preferences (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2511">#2511</a>)
by <a
href="https://github.com/pubpub-zz"><code>@​pubpub-zz</code></a></li>
<li>Add += and -= operators to ArrayObject (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2510">#2510</a>)
by <a
href="https://github.com/pubpub-zz"><code>@​pubpub-zz</code></a></li>
</ul>
<h3>Bug Fixes (BUG)</h3>
<ul>
<li>Fix merge_page sometimes generating unknown operator 'QQ' (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2588">#2588</a>)
by <a href="https://github.com/rfotino"><code>@​rfotino</code></a></li>
<li>Fix fields update where annotations are kids of field (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2570">#2570</a>)
by <a
href="https://github.com/pubpub-zz"><code>@​pubpub-zz</code></a></li>
<li>Process CMYK images without a filter correctly (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2557">#2557</a>)
by <a
href="https://github.com/pubpub-zz"><code>@​pubpub-zz</code></a></li>
<li>Extract text in layout mode without finding resources (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2555">#2555</a>)
by <a
href="https://github.com/pubpub-zz"><code>@​pubpub-zz</code></a></li>
<li>Prevent recursive loop in some PDF files (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2505">#2505</a>)
by <a
href="https://github.com/pubpub-zz"><code>@​pubpub-zz</code></a></li>
</ul>
<h3>Robustness (ROB)</h3>
<ul>
<li>Tolerate &quot;truncated&quot; xref (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2580">#2580</a>)
by <a
href="https://github.com/pubpub-zz"><code>@​pubpub-zz</code></a></li>
<li>Replace error by warning for EOD in RunLengthDecode/ASCIIHexDecode
(<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2334">#2334</a>)
by <a
href="https://github.com/pubpub-zz"><code>@​pubpub-zz</code></a></li>
<li>Rebuild xref table if one entry is invalid (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2528">#2528</a>)
by <a
href="https://github.com/pubpub-zz"><code>@​pubpub-zz</code></a></li>
<li>Robustify stream extraction (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2526">#2526</a>)
by <a
href="https://github.com/pubpub-zz"><code>@​pubpub-zz</code></a></li>
</ul>
<h3>Documentation (DOC)</h3>
<ul>
<li>Update release process for latest changes (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2564">#2564</a>)
by <a
href="https://github.com/stefan6419846"><code>@​stefan6419846</code></a></li>
<li>Encryption/decryption: Clone document instead of copying all pages
(<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2546">#2546</a>)
by <a
href="https://github.com/redfast00"><code>@​redfast00</code></a></li>
<li>Minor improvements (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2542">#2542</a>)
by <a href="https://github.com/j-t-1"><code>@​j-t-1</code></a></li>
<li>Update annotation list (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2534">#2534</a>)
by <a href="https://github.com/j-t-1"><code>@​j-t-1</code></a></li>
<li>Update references and formatting (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2529">#2529</a>)
by <a href="https://github.com/j-t-1"><code>@​j-t-1</code></a></li>
<li>Correct threads reference, plus minor changes (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2521">#2521</a>)
by <a href="https://github.com/j-t-1"><code>@​j-t-1</code></a></li>
<li>Minor readability increases (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2515">#2515</a>)
by <a href="https://github.com/j-t-1"><code>@​j-t-1</code></a></li>
<li>Simplify PaperSize examples (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2504">#2504</a>)
by <a href="https://github.com/j-t-1"><code>@​j-t-1</code></a></li>
<li>Minor improvements (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2501">#2501</a>)
by <a href="https://github.com/j-t-1"><code>@​j-t-1</code></a></li>
</ul>
<h3>Developer Experience (DEV)</h3>
<ul>
<li>Remove unused dependencies (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2572">#2572</a>)
by <a
href="https://github.com/stefan6419846"><code>@​stefan6419846</code></a></li>
<li>Remove page labels PR link from message (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2561">#2561</a>)
by <a
href="https://github.com/stefan6419846"><code>@​stefan6419846</code></a></li>
<li>Fix changelog generator regarding whitespace and handling of
&quot;Other&quot; group (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2492">#2492</a>)
by <a
href="https://github.com/stefan6419846"><code>@​stefan6419846</code></a></li>
<li>Add REL to known PR prefixes (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2554">#2554</a>)
by <a
href="https://github.com/stefan6419846"><code>@​stefan6419846</code></a></li>
<li>Release using the REL commit instead of git tag (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2500">#2500</a>)
by <a
href="https://github.com/MartinThoma"><code>@​MartinThoma</code></a></li>
<li>Unify code between PdfReader and PdfWriter (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2497">#2497</a>)
by <a
href="https://github.com/pubpub-zz"><code>@​pubpub-zz</code></a></li>
<li>Bump softprops/action-gh-release from 1 to 2 (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2514">#2514</a>)
by <a
href="https://github.com/dependabot"><code>@​dependabot</code></a>[bot]</li>
</ul>
<h3>Maintenance (MAINT)</h3>
<ul>
<li>Ressources → Resources (and internal name childs) (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2550">#2550</a>)
by <a
href="https://github.com/pubpub-zz"><code>@​pubpub-zz</code></a></li>
<li>Fix typos found by codespell (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2549">#2549</a>)
by <a
href="https://github.com/stefan6419846"><code>@​stefan6419846</code></a></li>
<li>Update Read the Docs configuration (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2538">#2538</a>)
by <a href="https://github.com/j-t-1"><code>@​j-t-1</code></a></li>
<li>Add root_object, _info and _ID to PdfReader (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2495">#2495</a>)
by <a
href="https://github.com/pubpub-zz"><code>@​pubpub-zz</code></a></li>
</ul>
<!-- raw HTML omitted -->
</blockquote>
<p>... (truncated)</p>
</details>
<details>
<summary>Changelog</summary>
<p><em>Sourced from <a
href="https://github.com/py-pdf/pypdf/blob/main/CHANGELOG.md">pypdf's
changelog</a>.</em></p>
<blockquote>
<h2>Version 4.2.0, 2024-04-07</h2>
<h3>New Features (ENH)</h3>
<ul>
<li>Allow multiple charsets for NameObject.read_from_stream (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2585">#2585</a>)</li>
<li>Add support for /Kids in page labels (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2562">#2562</a>)</li>
<li>Allow to update fields on many pages (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2571">#2571</a>)</li>
<li>Tolerate PDF with invalid xref pointed objects (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2335">#2335</a>)</li>
<li>Add Enforce from PDF2.0 in viewer_preferences (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2511">#2511</a>)</li>
<li>Add += and -= operators to ArrayObject (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2510">#2510</a>)</li>
</ul>
<h3>Bug Fixes (BUG)</h3>
<ul>
<li>Fix merge_page sometimes generating unknown operator 'QQ' (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2588">#2588</a>)</li>
<li>Fix fields update where annotations are kids of field (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2570">#2570</a>)</li>
<li>Process CMYK images without a filter correctly (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2557">#2557</a>)</li>
<li>Extract text in layout mode without finding resources (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2555">#2555</a>)</li>
<li>Prevent recursive loop in some PDF files (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2505">#2505</a>)</li>
</ul>
<h3>Robustness (ROB)</h3>
<ul>
<li>Tolerate &quot;truncated&quot; xref (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2580">#2580</a>)</li>
<li>Replace error by warning for EOD in RunLengthDecode/ASCIIHexDecode
(<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2334">#2334</a>)</li>
<li>Rebuild xref table if one entry is invalid (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2528">#2528</a>)</li>
<li>Robustify stream extraction (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2526">#2526</a>)</li>
</ul>
<h3>Documentation (DOC)</h3>
<ul>
<li>Update release process for latest changes (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2564">#2564</a>)</li>
<li>Encryption/decryption: Clone document instead of copying all pages
(<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2546">#2546</a>)</li>
<li>Minor improvements (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2542">#2542</a>)</li>
<li>Update annotation list (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2534">#2534</a>)</li>
<li>Update references and formatting (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2529">#2529</a>)</li>
<li>Correct threads reference, plus minor changes (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2521">#2521</a>)</li>
<li>Minor readability increases (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2515">#2515</a>)</li>
<li>Simplify PaperSize examples (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2504">#2504</a>)</li>
<li>Minor improvements (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2501">#2501</a>)</li>
</ul>
<h3>Developer Experience (DEV)</h3>
<ul>
<li>Remove unused dependencies (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2572">#2572</a>)</li>
<li>Remove page labels PR link from message (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2561">#2561</a>)</li>
<li>Fix changelog generator regarding whitespace and handling of
&quot;Other&quot; group (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2492">#2492</a>)</li>
<li>Add REL to known PR prefixes (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2554">#2554</a>)</li>
<li>Release using the REL commit instead of git tag (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2500">#2500</a>)</li>
<li>Unify code between PdfReader and PdfWriter (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2497">#2497</a>)</li>
<li>Bump softprops/action-gh-release from 1 to 2 (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2514">#2514</a>)</li>
</ul>
<h3>Maintenance (MAINT)</h3>
<ul>
<li>Ressources → Resources (and internal name childs) (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2550">#2550</a>)</li>
<li>Fix typos found by codespell (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2549">#2549</a>)</li>
<li>Update Read the Docs configuration (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2538">#2538</a>)</li>
<li>Add root_object, _info and _ID to PdfReader (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2495">#2495</a>)</li>
</ul>
<h3>Testing (TST)</h3>
<!-- raw HTML omitted -->
</blockquote>
<p>... (truncated)</p>
</details>
<details>
<summary>Commits</summary>
<ul>
<li><a
href="https://github.com/py-pdf/pypdf/commit/2ac88e635c8e16478fc11bde02411cfa573a7807"><code>2ac88e6</code></a>
REL: 4.2.0</li>
<li><a
href="https://github.com/py-pdf/pypdf/commit/561b1b07e149fdd45f78f5263ea7ca1a01b9bd06"><code>561b1b0</code></a>
BUG: Fix merge_page sometimes generating unknown operator 'QQ' (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2588">#2588</a>)</li>
<li><a
href="https://github.com/py-pdf/pypdf/commit/ae0d27b72a10f9239d2eed93420980af0bb21ba9"><code>ae0d27b</code></a>
TST: Allow loading truncated images if required (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2586">#2586</a>)</li>
<li><a
href="https://github.com/py-pdf/pypdf/commit/956fd030c7455dab360b5cb5ce997a19ee7290f4"><code>956fd03</code></a>
ENH: Allow multiple charsets for NameObject.read_from_stream (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2585">#2585</a>)</li>
<li><a
href="https://github.com/py-pdf/pypdf/commit/0f7c8fe4face5e5cd3f4620a8cbd0e44b888698e"><code>0f7c8fe</code></a>
ROB: Tolerate &quot;truncated&quot; xref (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2580">#2580</a>)</li>
<li><a
href="https://github.com/py-pdf/pypdf/commit/6152893d6772b1ba6e08e2a9747d10419103de22"><code>6152893</code></a>
TST: Fix download issues from <a
href="https://redirect.github.com/py-pdf/pypdf/issues/2562">#2562</a>
(<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2578">#2578</a>)</li>
<li><a
href="https://github.com/py-pdf/pypdf/commit/bc299012f5253f04c3d990c8d1b3eec97c7d2006"><code>bc29901</code></a>
ENH: Add support for /Kids in page labels (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2562">#2562</a>)</li>
<li><a
href="https://github.com/py-pdf/pypdf/commit/4bdca1628606706f652f02c260c2e7dd53fb2ff7"><code>4bdca16</code></a>
ENH: Allow to update fields on many pages (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2571">#2571</a>)</li>
<li><a
href="https://github.com/py-pdf/pypdf/commit/4d3d797b40a572e0f30408614647b2269bca5aba"><code>4d3d797</code></a>
DEV: Remove unused dependencies (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2572">#2572</a>)</li>
<li><a
href="https://github.com/py-pdf/pypdf/commit/50bb7f26a3574bb92448c2b64d0d125435f27638"><code>50bb7f2</code></a>
BUG: Fix fields update where annotations are kids of field (<a
href="https://redirect.github.com/py-pdf/pypdf/issues/2570">#2570</a>)</li>
<li>Additional commits viewable in <a
href="https://github.com/py-pdf/pypdf/compare/3.17.4...4.2.0">compare
view</a></li>
</ul>
</details>
<br />


[![Dependabot compatibility
score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=pypdf&package-manager=pip&previous-version=3.17.4&new-version=4.2.0)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores)

Dependabot will resolve any conflicts with this PR as long as you don't
alter it yourself. You can also trigger a rebase manually by commenting
`@dependabot rebase`.

[//]: # (dependabot-automerge-start)
[//]: # (dependabot-automerge-end)

---

<details>
<summary>Dependabot commands and options</summary>
<br />

You can trigger Dependabot actions by commenting on this PR:
- `@dependabot rebase` will rebase this PR
- `@dependabot recreate` will recreate this PR, overwriting any edits
that have been made to it
- `@dependabot merge` will merge this PR after your CI passes on it
- `@dependabot squash and merge` will squash and merge this PR after
your CI passes on it
- `@dependabot cancel merge` will cancel a previously requested merge
and block automerging
- `@dependabot reopen` will reopen this PR if it is closed
- `@dependabot close` will close this PR and stop Dependabot recreating
it. You can achieve the same result by closing it manually
- `@dependabot show <dependency name> ignore conditions` will show all
of the ignore conditions of the specified dependency
- `@dependabot ignore this major version` will close this PR and stop
Dependabot creating any more for this major version (unless you reopen
the PR or upgrade to it yourself)
- `@dependabot ignore this minor version` will close this PR and stop
Dependabot creating any more for this minor version (unless you reopen
the PR or upgrade to it yourself)
- `@dependabot ignore this dependency` will close this PR and stop
Dependabot creating any more for this dependency (unless you reopen the
PR or upgrade to it yourself)


</details>
  • Loading branch information
VincentAntoine committed Jun 25, 2024
2 parents 3b78299 + 86b71d8 commit e3068fe
Show file tree
Hide file tree
Showing 2 changed files with 5 additions and 5 deletions.
8 changes: 4 additions & 4 deletions datascience/poetry.lock

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

2 changes: 1 addition & 1 deletion datascience/pyproject.toml
Original file line number Diff line number Diff line change
Expand Up @@ -31,7 +31,7 @@ prefect = "^1.3"
requests-mock = "^1.11.0"
duckdb = "^0.10.1"
h3 = "^3.7.6"
pypdf = "^3.16.4"
pypdf = "^4.2.0"


[tool.poetry.group.dev.dependencies]
Expand Down

0 comments on commit e3068fe

Please sign in to comment.