Update dependency charset-normalizer to v3.3.2
This MR contains the following updates:
Package | Update | Change |
---|---|---|
charset-normalizer | minor |
==3.1.0 -> ==3.3.2
|
⚠ Dependency Lookup Warnings ⚠
Warnings were logged while processing this repo. Please check the logs for more information.
Release Notes
Ousret/charset_normalizer
v3.3.2
Fixed
- Unintentional memory usage regression when using large payload that match several encoding (#376)
- Regression on some detection case showcased in the documentation (#371)
Added
- Noise (md) probe that identify malformed arabic representation due to the presence of letters in isolated form (credit to my wife)
v3.3.1
Changed
- Optional mypyc compilation upgraded to version 1.6.1 for Python >= 3.8
- Improved the general detection reliability based on reports from the community
v3.3.0
Added
- Allow to execute the CLI (e.g. normalizer) through
python -m charset_normalizer.cli
orpython -m charset_normalizer
- Support for 9 forgotten encoding that are supported by Python but unlisted in
encoding.aliases
as they have no alias (#323)
Removed
- (internal) Redundant utils.is_ascii function and unused function is_private_use_only
- (internal) charset_normalizer.assets is moved inside charset_normalizer.constant
Changed
- (internal) Unicode code blocks in constants are updated using the latest v15.0.0 definition to improve detection
- Optional mypyc compilation upgraded to version 1.5.1 for Python >= 3.8
Fixed
- Unable to properly sort CharsetMatch when both chaos/noise and coherence were close due to an unreachable condition in __lt__ (#350)
v3.2.0
Changed
- Typehint for function
from_path
no longer enforcePathLike
as its first argument - Minor improvement over the global detection reliability
Added
- Introduce function
is_binary
that relies on main capabilities, and optimized to detect binaries - Propagate
enable_fallback
argument throughoutfrom_bytes
,from_path
, andfrom_fp
that allow a deeper control over the detection (default True) - Explicit support for Python 3.12
Fixed
- Edge case detection failure where a file would contain 'very-long' camel cased word (Issue #289)
Configuration
-
If you want to rebase/retry this MR, check this box
This MR has been generated by Renovate Bot.