Annotation of loncom/html/htmlarea/plugins/SpellChecker/readme-tech.html, revision 1.2
1.1 www 1: <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 3.2//EN">
2: <html>
3: <head>
4: <title>HTMLArea Spell Checker</title>
5: </head>
6:
7: <body>
8: <h1>HTMLArea Spell Checker</h1>
9:
10: <p>The HTMLArea Spell Checker subsystem consists of the following
11: files:</p>
12:
13: <ul>
14:
15: <li>spell-checker.js — the spell checker plugin interface for
16: HTMLArea</li>
17:
18: <li>spell-checker-ui.html — the HTML code for the user
19: interface</li>
20:
21: <li>spell-checker-ui.js — functionality of the user
22: interface</li>
23:
24: <li>spell-checker-logic.cgi — Perl CGI script that checks a text
25: given through POST for spelling errors</li>
26:
27: <li>spell-checker-style.css — style for mispelled words</li>
28:
29: <li>lang/en.js — main language file (English).</li>
30:
31: </ul>
32:
33: <h2>Process overview</h2>
34:
35: <p>
36: When an end-user clicks the "spell-check" button in the HTMLArea
37: editor, a new window is opened with the URL of "spell-check-ui.html".
38: This window initializes itself with the text found in the editor (uses
39: <tt>window.opener.SpellChecker.editor</tt> global variable) and it
40: submits the text to the server-side script "spell-check-logic.cgi".
41: The target of the FORM is an inline frame which is used both to
42: display the text and correcting.
43: </p>
44:
45: <p>
46: Further, spell-check-logic.cgi calls Aspell for each portion of plain
47: text found in the given HTML. It rebuilds an HTML file that contains
48: clear marks of which words are incorrect, along with suggestions for
49: each of them. This file is then loaded in the inline frame. Upon
50: loading, a JavaScript function from "spell-check-ui.js" is called.
51: This function will retrieve all mispelled words from the HTML of the
52: iframe and will setup the user interface so that it allows correction.
53: </p>
54:
55: <h2>The server-side script (spell-check-logic.cgi)</h2>
56:
57: <p>
58: <strong>Unicode safety</strong> — the program <em>is</em>
59: Unicode safe. HTML entities are expanded into their corresponding
60: Unicode characters. These characters will be matched as part of the
61: word passed to Aspell. All texts passed to Aspell are in Unicode
1.2 ! www 62: (when appropriate). <strike>However, Aspell seems to not support Unicode
1.1 www 63: yet (<a
64: href="http://mail.gnu.org/archive/html/aspell-user/2000-11/msg00007.html">thread concerning Aspell and Unicode</a>).
65: This mean that words containing Unicode
1.2 ! www 66: characters that are not in 0..255 are likely to be reported as "mispelled" by Aspell.</strike>
1.1 www 67: </p>
68:
69: <p>
1.2 ! www 70: <strong style="font-variant: small-caps; color:
! 71: red;">Update:</strong> though I've never seen it mentioned
! 72: anywhere, it looks that Aspell <em>does</em>, in fact, speak
! 73: Unicode. Or else, maybe <code>Text::Aspell</code> does
! 74: transparent conversion; anyway, this new version of our
! 75: SpellChecker plugin is, as tests show so far, fully
! 76: Unicode-safe... well, probably the <em>only</em> freeware
! 77: Web-based spell-checker which happens to have Unicode support.
1.1 www 78: </p>
79:
80: <p>
81: The Perl Unicode manual (man perluniintro) states:
82: </p>
83:
84: <blockquote>
85: <em>
86: Starting from Perl 5.6.0, Perl has had the capacity to handle Unicode
87: natively. Perl 5.8.0, however, is the first recommended release for
88: serious Unicode work. The maintenance release 5.6.1 fixed many of the
89: problems of the initial Unicode implementation, but for example regular
90: expressions still do not work with Unicode in 5.6.1.
91: </em>
92: </blockquote>
93:
94: <p>In other words, do <em>not</em> assume that this script is
95: Unicode-safe on Perl interpreters older than 5.8.0.</p>
96:
97: <p>The following Perl modules are required:</p>
98:
99: <ul>
100: <li><a href="http://search.cpan.org/search?query=Text%3A%3AAspell&mode=all" target="_blank">Text::Aspell</a></li>
1.2 ! www 101: <li><a href="http://search.cpan.org/search?query=XML%3A%3ADOM&mode=all" target="_blank">XML::DOM</a></li>
1.1 www 102: <li><a href="http://search.cpan.org/search?query=CGI&mode=all" target="_blank">CGI</a></li>
103: </ul>
104:
105: <p>Of these, only Text::Aspell might need to be installed manually. The
106: others are likely to be available by default in most Perl distributions.</p>
107:
108: <hr />
1.2 ! www 109: <address><a href="http://dynarch.com/mishoo/">Mihai Bazon</a></address>
1.1 www 110: <!-- Created: Thu Jul 17 13:22:27 EEST 2003 -->
1.2 ! www 111: <!-- hhmts start --> Last modified: Fri Jan 30 19:14:11 EET 2004 <!-- hhmts end -->
1.1 www 112: <!-- doc-lang: English -->
113: </body>
114: </html>
FreeBSD-CVSweb <freebsd-cvsweb@FreeBSD.org>