AnonSec Shell
Server IP : 209.38.156.173  /  Your IP : 216.73.216.122   [ Reverse IP ]
Web Server : Apache/2.4.52 (Ubuntu)
System : Linux lakekumayuhotel 5.15.0-136-generic #147-Ubuntu SMP Sat Mar 15 15:53:30 UTC 2025 x86_64
User : root ( 0)
PHP Version : 8.1.2-1ubuntu2.22
Disable Function : NONE
Domains : 2 Domains
MySQL : OFF  |  cURL : ON  |  WGET : ON  |  Perl : ON  |  Python : OFF  |  Sudo : ON  |  Pkexec : ON
Directory :  /lib/python3/dist-packages/chardet/__pycache__/

Upload File :
current_dir [ Writeable ] document_root [ Writeable ]

 

Command :


[ HOME ]     [ BACKUP SHELL ]     [ JUMPING ]     [ MASS DEFACE ]     [ SCAN ROOT ]     [ SYMLINK ]     

Current File : /lib/python3/dist-packages/chardet/__pycache__/charsetprober.cpython-310.pyc
o

-��_��@s0ddlZddlZddlmZGdd�de�ZdS)�N�)�ProbingStatec@sneZdZdZddd�Zdd�Zedd��Zd	d
�Zedd��Z	d
d�Z
edd��Zedd��Z
edd��ZdS)�
CharSetProbergffffff�?NcCsd|_||_t�t�|_dS�N)�_state�lang_filter�logging�	getLogger�__name__�logger)�selfr�r
�7/usr/lib/python3/dist-packages/chardet/charsetprober.py�__init__'szCharSetProber.__init__cCstj|_dSr)r�	DETECTINGr�rr
r
r�reset,szCharSetProber.resetcC�dSrr
rr
r
r�charset_name/szCharSetProber.charset_namecCrrr
)r�bufr
r
r�feed3�zCharSetProber.feedcCs|jSr)rrr
r
r�state6szCharSetProber.statecCsdS)Ngr
rr
r
r�get_confidence:rzCharSetProber.get_confidencecCst�dd|�}|S)Ns([-])+� )�re�sub)rr
r
r�filter_high_byte_only=sz#CharSetProber.filter_high_byte_onlycCs\t�}t�d|�}|D] }|�|dd��|dd�}|��s&|dkr&d}|�|�q|S)u9
        We define three types of bytes:
        alphabet: english alphabets [a-zA-Z]
        international: international characters [€-ÿ]
        marker: everything else [^a-zA-Z€-ÿ]

        The input buffer can be thought to contain a series of words delimited
        by markers. This function works to filter all words that contain at
        least one international character. All contiguous sequences of markers
        are replaced by a single space ascii character.

        This filter applies to all scripts which do not use English characters.
        s%[a-zA-Z]*[�-�]+[a-zA-Z]*[^a-zA-Z�-�]?N�����r)�	bytearrayr�findall�extend�isalpha)r�filtered�words�word�	last_charr
r
r�filter_international_wordsBs�z(CharSetProber.filter_international_wordscCs�t�}d}d}tt|��D]7}|||d�}|dkrd}n|dkr$d}|dkrD|��sD||kr@|s@|�|||��|�d�|d}q
|sP|�||d	��|S)
a�
        Returns a copy of ``buf`` that retains only the sequences of English
        alphabet and high byte characters that are not between <> characters.
        Also retains English alphabet and high byte characters immediately
        before occurrences of >.

        This filter can be applied to all scripts which contain both English
        characters and extended ASCII characters, but is currently only used by
        ``Latin1Prober``.
        Frr�>�<TrrN)r �range�lenr#r")rr$�in_tag�prev�curr�buf_charr
r
r�filter_with_english_lettersgs$
�z)CharSetProber.filter_with_english_lettersr)r
�
__module__�__qualname__�SHORTCUT_THRESHOLDrr�propertyrrrr�staticmethodrr(r1r
r
r
rr#s 




$r)rr�enumsr�objectrr
r
r
r�<module>s

Anon7 - 2022
AnonSec Team