camel_tools.utils.dediac

This submodule contains functions for dediacritizing Arabic text in different encodings. See Encoding Schemes for more information on encodings.

Functions

camel_tools.utils.dediac.dediac_ar(s)

Dediacritize Unicode Arabic string.

Parameters:s (str) – String to dediacritize.
Returns:Dediacritized string.
Return type:str
camel_tools.utils.dediac.dediac_bw(s)

Dediacritize Buckwalter encoded string.

Parameters:s (str) – String to dediacritize.
Returns:Dediacritized string.
Return type:str
camel_tools.utils.dediac.dediac_safebw(s)

Dediacritize Safe Buckwalter encoded string.

Parameters:s (str) – String to dediacritize.
Returns:Dediacritized string.
Return type:str
camel_tools.utils.dediac.dediac_xmlbw(s)

Dediacritize XML Buckwalter encoded string.

Parameters:s (str) – String to dediacritize.
Returns:Dediacritized string.
Return type:str
camel_tools.utils.dediac.dediac_hsb(s)

Dediacritize Habash-Soudi-Buckwalter encoded string.

Parameters:s (str) – String to dediacritize.
Returns:Dediacritized string.
Return type:str