Aim of the website is it to display the similarity of words of a dataset based on the Levenshtein Distance. Levenshein Distance (aka Edit Distance): [url removed, login to view]
[url removed, login to view]~kleiweg/lev/
[url removed, login to view]
The dataset is currently an Excel sheet containing one column with Swedish words and one column with the corresponding German translations:
After clicking the "start" button, a single word (either an Swedish or German word) from the dataset is displayed on the website wherein each character of the word is shown in a colored "square".
This word is then compared to its corresponding translation in the other language and the Levenshtein Distance between these two words is calculated (Levenshtein distance is obtained by finding the cheapest way to transform one string into another. Transformations are the one-step operations of (single-phone) insertion, deletion and substitution).
This cheapest way of transformation is then visualised on the website:
II. THE STEPS INVOLVED:
Firstly, the user should be able to chose the source language (the initially displayed language) and the target language (the language to which the initially word is translated to).
E.g. The user chooses Swedish as source language. Then the user clicks the start button and the word "NEJ" appears (meaning no in English). Its German translation is "NEIN".
Then the user clicks the "translate" button and the the Levenshtein distance is calculated; in this case:
(1) delete "J" of "NEJ" to obtain "NE"; then
(2) add "I" to the end of "NE" to obtain "NEI"; then
(3) add "N" to the end of "NEI" to obatin "NEIN".
This "Levenshtein Path" should be visualised; ie. the user clicks the "translate" button and the animation starts:
In step (1) the box containing "J" disappears.
In step (2) the box containing "I" appears.
In step (3) the box containing "N" appears.
The steps should be sequentially.
After the transformation is done also the word in the source language is displayed again.
A "back" button should allow the user to see the transformation backwards from the target language to the source language.
A "next word" button will allow the user to move on to the next word in the data set to start the process for another word.
III. FURTHER SPECS
The webtool to be developed should run on all standard webbrowsers.
The freelancer will receive an example dataset for developing the webtool. The freelancer should decide on the appropriate scripting and database so that the tool runs smoothly. However, the tool should not use Flashplayer for visualisation (it should also run on android tablets and ipads).
It would be nice (but not necessary) if I could run the tool locally on my Windows 7 PC under Apache for testing.
In case you should have any further questions, please do not hesitate to contact me.
Please find also sample dataset data attached as xls file!