Hàm Số Chuyển đổi Chuỗi Unicode Tổ Hợp Sang ... - Gists · GitHub

Skip to content Search Gists Search Gists All gists Back to GitHub Sign in Sign up Sign in Sign up Dismiss alert {{ message }}

Instantly share code, notes, and snippets.

@anhtran anhtran/unicode_utils.py Created April 18, 2018 05:41 Show Gist options
  • Star (12) You must be signed in to star a gist
  • Fork (2) You must be signed in to fork a gist
  • Embed Select an option
    • Embed Embed this gist in your website.
    • Share Copy sharable link for this gist.
    • Clone via HTTPS Clone using the web URL.

    No results found

    Learn more about clone URLs Clone this repository at <script src="https://gist.github.com/anhtran/d317a2c52b015bb243fd262f1cad0e4f.js"></script>
  • Save anhtran/d317a2c52b015bb243fd262f1cad0e4f to your computer and use it in GitHub Desktop.
Code Revisions 2 Stars 12 Forks 2 Embed Select an option
  • Embed Embed this gist in your website.
  • Share Copy sharable link for this gist.
  • Clone via HTTPS Clone using the web URL.

No results found

Learn more about clone URLs Clone this repository at <script src="https://gist.github.com/anhtran/d317a2c52b015bb243fd262f1cad0e4f.js"></script> Save anhtran/d317a2c52b015bb243fd262f1cad0e4f to your computer and use it in GitHub Desktop. Download ZIP Hàm số Chuyển đổi chuỗi Unicode Tổ Hợp sang Unicode Dựng Sẵn trong Python 3 Raw unicode_utils.py This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode characters Show hidden characters
def compound_unicode(unicode_str):
"""
Chuyển đổi chuỗi Unicode Tổ Hợp sang Unicode Dựng Sẵn
Edited from: `https://gist.github.com/redphx/9320735`
"""
unicode_str = unicode_str.replace("\u0065\u0309", "\u1EBB") # ẻ
unicode_str = unicode_str.replace("\u0065\u0301", "\u00E9") # é
unicode_str = unicode_str.replace("\u0065\u0300", "\u00E8") # è
unicode_str = unicode_str.replace("\u0065\u0323", "\u1EB9") # ẹ
unicode_str = unicode_str.replace("\u0065\u0303", "\u1EBD") # ẽ
unicode_str = unicode_str.replace("\u00EA\u0309", "\u1EC3") # ể
unicode_str = unicode_str.replace("\u00EA\u0301", "\u1EBF") # ế
unicode_str = unicode_str.replace("\u00EA\u0300", "\u1EC1") # ề
unicode_str = unicode_str.replace("\u00EA\u0323", "\u1EC7") # ệ
unicode_str = unicode_str.replace("\u00EA\u0303", "\u1EC5") # ễ
unicode_str = unicode_str.replace("\u0079\u0309", "\u1EF7") # ỷ
unicode_str = unicode_str.replace("\u0079\u0301", "\u00FD") # ý
unicode_str = unicode_str.replace("\u0079\u0300", "\u1EF3") # ỳ
unicode_str = unicode_str.replace("\u0079\u0323", "\u1EF5") # ỵ
unicode_str = unicode_str.replace("\u0079\u0303", "\u1EF9") # ỹ
unicode_str = unicode_str.replace("\u0075\u0309", "\u1EE7") # ủ
unicode_str = unicode_str.replace("\u0075\u0301", "\u00FA") # ú
unicode_str = unicode_str.replace("\u0075\u0300", "\u00F9") # ù
unicode_str = unicode_str.replace("\u0075\u0323", "\u1EE5") # ụ
unicode_str = unicode_str.replace("\u0075\u0303", "\u0169") # ũ
unicode_str = unicode_str.replace("\u01B0\u0309", "\u1EED") # ử
unicode_str = unicode_str.replace("\u01B0\u0301", "\u1EE9") # ứ
unicode_str = unicode_str.replace("\u01B0\u0300", "\u1EEB") # ừ
unicode_str = unicode_str.replace("\u01B0\u0323", "\u1EF1") # ự
unicode_str = unicode_str.replace("\u01B0\u0303", "\u1EEF") # ữ
unicode_str = unicode_str.replace("\u0069\u0309", "\u1EC9") # ỉ
unicode_str = unicode_str.replace("\u0069\u0301", "\u00ED") # í
unicode_str = unicode_str.replace("\u0069\u0300", "\u00EC") # ì
unicode_str = unicode_str.replace("\u0069\u0323", "\u1ECB") # ị
unicode_str = unicode_str.replace("\u0069\u0303", "\u0129") # ĩ
unicode_str = unicode_str.replace("\u006F\u0309", "\u1ECF") # ỏ
unicode_str = unicode_str.replace("\u006F\u0301", "\u00F3") # ó
unicode_str = unicode_str.replace("\u006F\u0300", "\u00F2") # ò
unicode_str = unicode_str.replace("\u006F\u0323", "\u1ECD") # ọ
unicode_str = unicode_str.replace("\u006F\u0303", "\u00F5") # õ
unicode_str = unicode_str.replace("\u01A1\u0309", "\u1EDF") # ở
unicode_str = unicode_str.replace("\u01A1\u0301", "\u1EDB") # ớ
unicode_str = unicode_str.replace("\u01A1\u0300", "\u1EDD") # ờ
unicode_str = unicode_str.replace("\u01A1\u0323", "\u1EE3") # ợ
unicode_str = unicode_str.replace("\u01A1\u0303", "\u1EE1") # ỡ
unicode_str = unicode_str.replace("\u00F4\u0309", "\u1ED5") # ổ
unicode_str = unicode_str.replace("\u00F4\u0301", "\u1ED1") # ố
unicode_str = unicode_str.replace("\u00F4\u0300", "\u1ED3") # ồ
unicode_str = unicode_str.replace("\u00F4\u0323", "\u1ED9") # ộ
unicode_str = unicode_str.replace("\u00F4\u0303", "\u1ED7") # ỗ
unicode_str = unicode_str.replace("\u0061\u0309", "\u1EA3") # ả
unicode_str = unicode_str.replace("\u0061\u0301", "\u00E1") # á
unicode_str = unicode_str.replace("\u0061\u0300", "\u00E0") # à
unicode_str = unicode_str.replace("\u0061\u0323", "\u1EA1") # ạ
unicode_str = unicode_str.replace("\u0061\u0303", "\u00E3") # ã
unicode_str = unicode_str.replace("\u0103\u0309", "\u1EB3") # ẳ
unicode_str = unicode_str.replace("\u0103\u0301", "\u1EAF") # ắ
unicode_str = unicode_str.replace("\u0103\u0300", "\u1EB1") # ằ
unicode_str = unicode_str.replace("\u0103\u0323", "\u1EB7") # ặ
unicode_str = unicode_str.replace("\u0103\u0303", "\u1EB5") # ẵ
unicode_str = unicode_str.replace("\u00E2\u0309", "\u1EA9") # ẩ
unicode_str = unicode_str.replace("\u00E2\u0301", "\u1EA5") # ấ
unicode_str = unicode_str.replace("\u00E2\u0300", "\u1EA7") # ầ
unicode_str = unicode_str.replace("\u00E2\u0323", "\u1EAD") # ậ
unicode_str = unicode_str.replace("\u00E2\u0303", "\u1EAB") # ẫ
unicode_str = unicode_str.replace("\u0045\u0309", "\u1EBA") # Ẻ
unicode_str = unicode_str.replace("\u0045\u0301", "\u00C9") # É
unicode_str = unicode_str.replace("\u0045\u0300", "\u00C8") # È
unicode_str = unicode_str.replace("\u0045\u0323", "\u1EB8") # Ẹ
unicode_str = unicode_str.replace("\u0045\u0303", "\u1EBC") # Ẽ
unicode_str = unicode_str.replace("\u00CA\u0309", "\u1EC2") # Ể
unicode_str = unicode_str.replace("\u00CA\u0301", "\u1EBE") # Ế
unicode_str = unicode_str.replace("\u00CA\u0300", "\u1EC0") # Ề
unicode_str = unicode_str.replace("\u00CA\u0323", "\u1EC6") # Ệ
unicode_str = unicode_str.replace("\u00CA\u0303", "\u1EC4") # Ễ
unicode_str = unicode_str.replace("\u0059\u0309", "\u1EF6") # Ỷ
unicode_str = unicode_str.replace("\u0059\u0301", "\u00DD") # Ý
unicode_str = unicode_str.replace("\u0059\u0300", "\u1EF2") # Ỳ
unicode_str = unicode_str.replace("\u0059\u0323", "\u1EF4") # Ỵ
unicode_str = unicode_str.replace("\u0059\u0303", "\u1EF8") # Ỹ
unicode_str = unicode_str.replace("\u0055\u0309", "\u1EE6") # Ủ
unicode_str = unicode_str.replace("\u0055\u0301", "\u00DA") # Ú
unicode_str = unicode_str.replace("\u0055\u0300", "\u00D9") # Ù
unicode_str = unicode_str.replace("\u0055\u0323", "\u1EE4") # Ụ
unicode_str = unicode_str.replace("\u0055\u0303", "\u0168") # Ũ
unicode_str = unicode_str.replace("\u01AF\u0309", "\u1EEC") # Ử
unicode_str = unicode_str.replace("\u01AF\u0301", "\u1EE8") # Ứ
unicode_str = unicode_str.replace("\u01AF\u0300", "\u1EEA") # Ừ
unicode_str = unicode_str.replace("\u01AF\u0323", "\u1EF0") # Ự
unicode_str = unicode_str.replace("\u01AF\u0303", "\u1EEE") # Ữ
unicode_str = unicode_str.replace("\u0049\u0309", "\u1EC8") # Ỉ
unicode_str = unicode_str.replace("\u0049\u0301", "\u00CD") # Í
unicode_str = unicode_str.replace("\u0049\u0300", "\u00CC") # Ì
unicode_str = unicode_str.replace("\u0049\u0323", "\u1ECA") # Ị
unicode_str = unicode_str.replace("\u0049\u0303", "\u0128") # Ĩ
unicode_str = unicode_str.replace("\u004F\u0309", "\u1ECE") # Ỏ
unicode_str = unicode_str.replace("\u004F\u0301", "\u00D3") # Ó
unicode_str = unicode_str.replace("\u004F\u0300", "\u00D2") # Ò
unicode_str = unicode_str.replace("\u004F\u0323", "\u1ECC") # Ọ
unicode_str = unicode_str.replace("\u004F\u0303", "\u00D5") # Õ
unicode_str = unicode_str.replace("\u01A0\u0309", "\u1EDE") # Ở
unicode_str = unicode_str.replace("\u01A0\u0301", "\u1EDA") # Ớ
unicode_str = unicode_str.replace("\u01A0\u0300", "\u1EDC") # Ờ
unicode_str = unicode_str.replace("\u01A0\u0323", "\u1EE2") # Ợ
unicode_str = unicode_str.replace("\u01A0\u0303", "\u1EE0") # Ỡ
unicode_str = unicode_str.replace("\u00D4\u0309", "\u1ED4") # Ổ
unicode_str = unicode_str.replace("\u00D4\u0301", "\u1ED0") # Ố
unicode_str = unicode_str.replace("\u00D4\u0300", "\u1ED2") # Ồ
unicode_str = unicode_str.replace("\u00D4\u0323", "\u1ED8") # Ộ
unicode_str = unicode_str.replace("\u00D4\u0303", "\u1ED6") # Ỗ
unicode_str = unicode_str.replace("\u0041\u0309", "\u1EA2") # Ả
unicode_str = unicode_str.replace("\u0041\u0301", "\u00C1") # Á
unicode_str = unicode_str.replace("\u0041\u0300", "\u00C0") # À
unicode_str = unicode_str.replace("\u0041\u0323", "\u1EA0") # Ạ
unicode_str = unicode_str.replace("\u0041\u0303", "\u00C3") # Ã
unicode_str = unicode_str.replace("\u0102\u0309", "\u1EB2") # Ẳ
unicode_str = unicode_str.replace("\u0102\u0301", "\u1EAE") # Ắ
unicode_str = unicode_str.replace("\u0102\u0300", "\u1EB0") # Ằ
unicode_str = unicode_str.replace("\u0102\u0323", "\u1EB6") # Ặ
unicode_str = unicode_str.replace("\u0102\u0303", "\u1EB4") # Ẵ
unicode_str = unicode_str.replace("\u00C2\u0309", "\u1EA8") # Ẩ
unicode_str = unicode_str.replace("\u00C2\u0301", "\u1EA4") # Ấ
unicode_str = unicode_str.replace("\u00C2\u0300", "\u1EA6") # Ầ
unicode_str = unicode_str.replace("\u00C2\u0323", "\u1EAC") # Ậ
unicode_str = unicode_str.replace("\u00C2\u0303", "\u1EAA") # Ẫ
return unicode_str
@tannghia025 Copy link

tannghia025 commented Jun 29, 2021

thiệt là tuyệt vời

Uh oh!

There was an error while loading. Please reload this page.

@peternguyenexpert Copy link

peternguyenexpert commented Sep 6, 2021

Có trên php hông bác nhỉ?

Uh oh!

There was an error while loading. Please reload this page.

@thundertd Copy link

thundertd commented Jan 27, 2022

hay quá, cám ơn bác nhiều nhé.

Uh oh!

There was an error while loading. Please reload this page.

@congnguyenit Copy link

congnguyenit commented Jun 16, 2022 edited Loading

Uh oh!

There was an error while loading. Please reload this page.

Có trên php hông bác nhỉ?

cái này chỉ là replace string thôi, ông còn đòi code PHP cho ông????

Uh oh!

There was an error while loading. Please reload this page.

@hieuhiti Copy link

hieuhiti commented Jun 23, 2022

Đây nha bạn https://gist.github.com/hieuhiti/52438445fceb9d3c10caf1d78cce6d1e

Uh oh!

There was an error while loading. Please reload this page.

@thaidoan868 Copy link

thaidoan868 commented May 28, 2024

Ngon dữ ta. Ở đây có ai gõ tiếng việt tổ hợp không? Trên linux mình dùng gõ tổ hợp cho tiện.

Uh oh!

There was an error while loading. Please reload this page.

@htk68 Copy link

htk68 commented Jun 4, 2024

Cảm ơn bạn nhiều ạ.

Uh oh!

There was an error while loading. Please reload this page.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment You can’t perform that action at this time.

Từ khóa » Chuyển Từ Unicode Dựng Sẵn Sang Tổ Hợp