Wordpress/tests/data/formatting/utf-8
2013-08-07 06:38:38 +00:00
..
entitize.py Initial import of unit-tests repository into develop.svn.wordpress.org. 2013-08-07 06:38:38 +00:00
entitized.txt Initial import of unit-tests repository into develop.svn.wordpress.org. 2013-08-07 06:38:38 +00:00
README Initial import of unit-tests repository into develop.svn.wordpress.org. 2013-08-07 06:38:38 +00:00
u-urlencode.py Initial import of unit-tests repository into develop.svn.wordpress.org. 2013-08-07 06:38:38 +00:00
u-urlencoded.txt Initial import of unit-tests repository into develop.svn.wordpress.org. 2013-08-07 06:38:38 +00:00
urlencode.py Initial import of unit-tests repository into develop.svn.wordpress.org. 2013-08-07 06:38:38 +00:00
urlencoded.txt Initial import of unit-tests repository into develop.svn.wordpress.org. 2013-08-07 06:38:38 +00:00
utf-8.txt Initial import of unit-tests repository into develop.svn.wordpress.org. 2013-08-07 06:38:38 +00:00

The Python scripts are for generating test data, because Python's Unicode
support is much, much, much, much better than PHP's.

 * `utf-8/urlencode.py`, `utf-8/u-urlencode.py` and `utf-8/entitize.py` process UTF-8
   into a few different formats (%-encoding, %u-encoding, &#decimal;)
   and are used like normal UNIXy pipes.

   Try:

   `python urlencode.py < utf-8.txt > urlencoded.txt`
   `python u-urlencode.py < utf-8.txt > u-urlencoded.txt`
   `python entitize.py < utf-8.txt > entitized.txt`

  * `windows-1252.py` converts Windows-only smart-quotes and things
    into their unicode &#decimal reference; equivalents.