Metadata-Version: 2.1
Name: degrotesque
Version: 0.4
Summary: A tiny web type setter
Home-page: https://github.com/dkrajzew/degrotesque
Author: dkrajzew
Author-email: d.krajzewicz@gmail.com
License: UNKNOWN
Description: degrotesque - a tiny web type setter
        
        The script loads a HTML page - or several in batch, one after the other - and for each, it replaces some commonly used non-typographic characters, such as ", ', -, etc. into their typographic representant for improving the pages' appearance.  
        
        E.g.:
        
         "Well - that's not what I had expected."
        
        will become:
        
         &ldquo;Well &mdash; that&apos;s not what I had expected.&rdquo;
        
        (Uhm, uhm, for those who don't see it, the starting and ending quotes have been replaced by &amp;ldquo; and &amp;rdquo;, respectively, the ' by &amp;amp; and the - by a &amp;mdash;.)
        
        The script has the following options:
        * --input/-i: the file or the folder to process
        * --recursive/-r: Set if the folder - if given - shall be processed recursively
        * --no-backup/-B: Set if no backup files shall be generated
        * --actions/-a: Name the actions that shall be applied
        * --extensions/-e: The extensions of files that shall be processed
        
        All of the text is replaced. This means everything not within a &lt; and a &gt;. But yes, the script is smart enough to skip the contents of the elements "pre", "style", "script", "code", and "<?".
        
        The default actions are: quotes.english, dashes, ellipsis, math, apostroph. The list of all implemented actions is given below, as well as the default extensions of files that will be passed if a folder is given.
        
        There are some caveats, yes:
        * If you embed HTML code in HTML (not suported by HTML, but who cares), it may yield in odd behaviour.
        * If you have php-pages and combine php-generated and plain HTML text, it may yield in odd behaviour. Etc. So you should check your pages for correctness after applying degrotesque.
        
        degrotesque is licensed under the [GPL v3.0](LICENSE.txt).
        
        Well, have fun. If you have any questions or comments, let me know.
        
        Named Actions
        -------------
        
        The following action sets are currently implemented. 
        
        | Action Name | From Opening String | From Closing String | To Opening String | To Closing String |
        | ---- | ---- | ---- | ---- | ---- |
        | quotes.english | " '" | "'" | " &lsquo;" | "&rsquo;" |
        | | "\"" | "\"" | "&ldquo;" | "&rdquo;" |
        | quotes.french | "&lt;" | "&gt;" | "&lsaquo;" | "&rsaquo;" |
        | | "&lt;&lt;" | "&gt;&gt;" | "&laquo;" | "&raquo;" |
        | quotes.german | " '" | "'" | " &sbquo;" | "&rsquo;" |
        | | "\"" | "\"" | "&bdquo;" | "&rdquo;" |
        | to_quotes | " '" | "'" | " &lt;q&gt;" | "&lt;/q&gt;" |
        | | "\"" | "\"" | "&lt;q&gt;" | "&lt;/q&gt;" |
        | | "&lt;&lt;" | "&gt;&gt;" | "&lt;q&gt;" | "&lt;/q&gt;" |
        | commercial | "(c)" | | "&copy;" | |
        | | "(C)" | | "&copy;" | |
        | | "(r)" | | "&reg;" | |
        | | "(R)" | | "&reg;" | |
        | | "(tm)" | | "&trade;" | |
        | | "(TM)" | | "&trade;" | |
        | dashes | " - " | | "&mdash;" | |
        | bullets | "*" | | "&bull;" | |
        | ellipsis | "..." | | "&hellip;" | |
        | apostrophe | "'" | | "&apos;" | |
        | math | "+/-" | | "&plusmn;" | |
        | | "1/2" | | "&frac12;" | |
        | | "1/4" | | "&frac14;" | |
        | | "~" | | "&asymp;" | |
        | | "!=" | | "&ne;" | |
        | | "<=" | | "&le;" | |
        | | ">=" | | "&ge;" | |
        | dagger | "**" | | "&Dagger;" | |
        | | "*" | | "&dagger;" | |
         
        Default Extensions
        ------------------
        
        Files with the following extensions are parsed per default:
        * html, htm, xhtml,
        * php, phtml, phtm, php2, php3, php4, php5,
        * asp, 
        * jsp, jspx, 
        * shtml, shtm, sht, stm,
        * vbhtml,
        * ppthtml,   
        * ssp, jhtml
         
        Notes
        -----
        * I tried [Genshi](https://genshi.edgewall.org/), [BeautifulSoup](https://www.crummy.com/software/BeautifulSoup/), and [lxml](https://lxml.de/). All missed in keeping the code unchanged. So the parser just skips HTML-elements and the contents of some special elements, see above. Works in most cases.
        
        
        
        
Platform: UNKNOWN
Classifier: Development Status :: 3 - Alpha
Classifier: Programming Language :: Python :: 2.7
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: GNU General Public License v3 (GPLv3)
Classifier: Operating System :: OS Independent
Requires-Python: >=2.7, <4
Description-Content-Type: text/markdown
