For GW project

23rd December 2017

Second draft of JSON export from GW

Download: gydic2.json approx 1 MB

  1. I added an "audio" attribute whose value should be the filename for the lexical item. Note that there is not a perfect correspondence between John's set (here: https://www.dropbox.com/sh/u7rtl0mlyiv80fn/AAAOTPrad7Kgu7cX3cN4wi2Ka/WordsS%2BM?dl=0) and the names mapped by Gayarragi, Winangali and previously adapted for Ma! I imagine that some changes were made to the lexical stock for Ma! However, I have made as close a likely correspondence set as I can. Someone should check the matches between written and audio forms later. I can supply a spreadsheet of the correspondences that have been used to generate the JSON.
  2. I took away the long-form language name attribute "lgs2" as it's probably not needed.
  3. I omitted to mention in previous notes (22nd Dec 2017) that I have also supplied attributes for sense numbers ("snum") where there is more than one sense for an entry.

22nd December 2017

First draft of JSON export from GW

Download: gydic.json approx 1 MB

Following discussion with Ben and my work with the existing dictionary data and lexicographic principles, the following suggested amendments/additions have been made to the JSON schema:

  1. Subentries. The ovided schema had no provision for subentries, whether embedded in their parent entries or otherwise distinguished. I added an attribute "issub" at the headword level which indicates whether an entry is a subentry or not (these are in serial order, ie a subentry will logically (but not syntactically) be a child of a preceding head entry).
  2. Part of speech. The supplied schema has a "ps" attribute (which I presume is part of speech) - but at the sense level, which is an unusual lexicographic practice. I have placed a "ps" attribute at the headword level.
  3. Languages. As discussed, John wants the relevant language(s) indicated. Language information can apply at the headword or the sense level, so I have used our data to supply a "lgs" attribute at the headword level and a "slgs" attribute at the sense level. Also I added the other data we have at the headword level with full names of the languages "lgs2" - I can delete this by regenerating with different rules or using regex if it's a problem.
  4. Status (eg 'new'). John wants the capability to indicate if words are new (and perhaps other related info, such as links to discussion), so I added an attribute at the headword level "newstatus" and also at the sense level "snewstatus" (because existing words can be given new senses).
  5. Sounds - as I think we discussed in Cairns, in Gayarragi, Winangali the pronounced words are represented as a group of word sounds on a fixed time grid within single audio files, where individual words are played at appropriate calculated offsets. I think John said that he has individual sound files but I don't have them, so I am not sure of their filenames. John may have said that the sound files were simply named as the "id" attribute value plus the appropriate extension for the audio format. So this might need a little further investigation or organising. If you can't generate the audio filenames/links, then John could let me know the exact forms of the audio filenames and I can add them to the JSON.
  6. As discussed, the senses have only "def", not "ge".
  7. There are a small number (about 30) of cross references (coded as a kind of HTML link) which I have left in place - these could be deleted via simple regex if they are a problem (or let me know, I can do it).
  8. One thing I couldn't understand is why senses (and sentence examples) are represented as objects if there is one instance only, but as arrays if there are multiple instances. This made generation a little more complex to program and I am not convinced that it is good "data modeling" (they should be consistent, e.g. arrays of one or more objects). Nevertheless, I have supplied the data with those structures.
  9. I checked the JSON file for validity and it was valid according to two different validators, so I am confident that it is well formed.
  10. If you need any changes to the JSON, just let me know, as I can easily make modifications to the generating program now that the main logic is written.

30 June 2014

Third Mac version of Gayarragi, winangali!

Download: GW.dmg approx 300 MB

Note: this is the application only - if you want the links to resources and help to work, place the new app (once taken out of the dowloaded volume etc) in the previous folder together with 'resources' and 'gwhelp'

A revised version of GW.app with these fixes:

Other minor notes (to add to previous changelog):

17 June 2014

Second Mac version of Gayarragi, winangali!

More details soon to come about many changes, corrections, and fixes in this version.

This version: draft of 17 June 2014
Download:GW-mac2.dmg approx 300 MB
Notes:

8 Dec 2013

1. Tab delimited file for subentries
John requested this. This file has all subentries, extracted and formatted as if they are normal headword entries.

Download: DictionaryTarget-Subentries.txt 213KB - tab-delimited file


2. Correction for entry balan
John noted that the entry for balan does not display properly. This was due to an error in the original data (also in the original CD, now corrected). In the Data Upload Template, sheet Dictionary Target, please change Detailed Entry (HTML) cell for 13002 balan as follows:

CHANGE:
<div class='lemma'><span class='form-main'>balan</span> <span class='pos'>noun</span> </div><div class='defblock'><span class='gloss'>z</span><div class='sensenotes'>ero</div></div>

TO:
<div class='lemma'><span class='form-main'>balan</span> <span class='pos'>noun</span> </div><div class='defblock'><span class='gloss'>zero</span></div>

Details also are also here as a plain text file: Download: balan-correction.txt 1KB - plain text file


26 Oct 2013

Tab delimited file for full languages data

Download: Languages2.txt 119KB - tab-delimited file (columns: Dictionary target ID -- Target Word -- Languages (global for word) -- Source word (English: gloss string + languages))

25 Oct 2013

Tab delimited file for additional languages data

Download: Languages.txt 106KB - tab-delimited file (columns: Dictionary target ID -- Target Word -- Languages -- Source word (English: gloss string))

24 Oct 2013

Tab delimited file for Data Upload Template, worksheet Dictionary Source

Download: DictionarySource.txt 101KB - tab-delimited file (columns: Dictionary Source ID -- Dictionary target ID -- Source word (English) -- Part of Speech -- Target Word)


Tab delimited file for Data Upload Template, worksheet Dictionary Target

Download: DictionaryTarget.txt 1.05MB - tab-delimited file (columns: Dictionary Target ID -- Target Word -- Languages -- Audio URL)

23 Oct 2013

File of HTML strings correponding to display entries for each headword. Indexed by IDs. In this version, subentries are nested under their correponding main entries.

Download: GW-entries-HTML.txt 1MB - tab-delimited file (columns: Original_ID -- new_ID -- display entry HTML)


Simple CSS file for display of HTML display entries - can be adapted for other purposes.

Download: gydex.css 1KB css/text file

22 Oct 2013

Gloss-based list with IDs. I'm still not clear how glosses are to be handled but here is a first export for comment. Note: the suffix forms are preceded by an en-rule, not hyphen (just to make it easier to work with this data in Excel)

Download: GW-glosses.txt 104KB - tab-delimited file

21 Oct 2013

First simple export of GW data for Ma!

It's not clear to me what data we'll settle on, so I thought to start by an initial simple export and get response/request from there for planning next steps.

Download: lexData.txt 124KB - tab-delimited text file (columns: Original_ID -- new_ID -- part of speech -- word form (with conjugation marker) -- gloss(es))


Here is a persistent file with the mapping of all lexical IDs from the current GW app to new IDs that should be suitable for Ma!

Download: maIDmapping.xls 1200KB - MS Excel file


HTML display versions of all entries have been generated (and can be supplied in modified form) - see this intermediate web version


29 April 2013

First Mac version of Gayarragi, winangali!

There is still much work to do, but the major hurdles overcome.

This version: draft of 28 April 2013
Download: GW-Mac-001.dmg approx 600 MB
This version has:

30 July 2012

Final GW user survey results. The survey is now closed. We got a total of 52 responses. Please view the documents below before our discussion about how to proceed.

all_summary.pdf Summary of all responses (except graphs don't display)
all_graphs.pdf All the graphs
all_details.pdf Full details of responses including comments/feedback

teacher_summary.pdf Summary of teacher responses
all_teachercomments.pdf Relevant details of teacher responses

learner_summary.pdf Summary of learner responses
all_learner-comments.pdf Relevant details of learner responses

parent_summary.pdf Summary of parent responses

GW-survey-questions.pdf For records only - original blank survey
GW-survey-notes.txt Summary notes

12 July 2012

GW-survey-results-3.doc Early GW user survey results

30 May 2012

Draft survey, to get evaluations and feedback in preparation for Mac and 2nd edition

11 MARCH 09

Update files only, for new stories
Download: newfiles.zip approx 30 MB

Notes: Please just write these files over the existing versions, and leave other files as they are. Note that they are not all in the same folder, eg main.cxt is in the top level, and the stories folder goes inside the mvcst folder (just look for matches on filenames). Let me know if you have any problems.

3 MARCH 09

Latest version
Download: movie.zip approx 162 MB

13 NOVEMBER 2008

Latest version
Download: GY20081113.zip approx 160 MB

Please see the Googledoc for details

25 AUGUST 08

Latest version
Download: gy.zip approx 160 MB
This version has:

1 July 08

Latest version
Download: gy.zip approx 100 MB
This version has:

16 May 2007

Mac version, almost same as 27 April (with minor changes only)
Download: GY2.DMG approx 200 MB

27 April 2007

Updated version - to keep download smaller, please re-use previous versions of the following:
startGY.exe, xtras folder, songs folder, word_sounds.cxt, sentence_sounds.cxt
Download: PROD14_Win.zip approx 8 MB
This version has:

10 March 2007

Updated version - please re-use previous versions of songs, stories and xtras folders, and startGY.exe.
Download: PROD13_Win.zip approx 60 MB
This version has:

28 December 2006

Updated version - please re-use previous versions of songs, stories and xtras folders, and word_sounds.cxt, and startGY.exe.
Download: PROD12.zip approx 10 MB
This version has:

17 November 2006

Updated version - re-use previous versions of songs, stories and xtras folders, and word_sounds.cxt, and startGY.exe.
Download: PROD11.zip approx 2.1 MB
This version has:

17 Oct 2006

Updated version - re-use previous versions of songs and stories folders, and word_sounds.cxt, and startGY.exe. This download has a new xtras folder.
Download: PROD10.zip approx 4 MB
This version has:

10 Oct 2006

Updated version - re-use previous versions of songs and stories folders, and word_sounds.cxt, and startGY.exe.
Download: PROD9.zip approx 2 MB
This version has:

28 September 2006

Updated version - re-use previous versions of songs and stories folders, and word_sounds.cxt, and startGY.exe. Hopefully it will all work!
Download: PROD8.zip approx 2 MB
This version has:

19 August 2006

Updated version - re-use previous versions of songs and stories folders, and word_sounds.cxt, and startGY.exe. Hopefully it will all work!
Download: PROD7.zip approx 2 MB
This version has:

5 June 2006

Updated version - all files except for folders "txts" and StartGY.exe (re-use previous versions of these and copy into correct locations).
Download: PROD6.zip approx 65 MB
This version has:

5 May 2006

Updated version - all files except for folders "txts" and "xtras".
Download: GY_5May06.zip approx 3.8 MB
This version has: To install this update, unzip all the files, then place whole of previously-sent directories "txts" and "xtras" in their relevant locations (ie same as in previous versions). let me know if any problems.

25 April 2006

Updated version - replacement files only.
Download: PROD3.zip approx 1.7 MB
This version has: Notes:

9 April 2006

Updated version of GY CD with dictionary search implemented.
Download: PROD3.zip approx 40 MB
See previous version info for unpack/install instructions. This version has: Related but not yet implemented:

Previous versions:

24 Jan 2006:

Updated version of GY CD with song player.
Download: GY_songs_draft2.zip 40 MB
New song player engine. New song menu etc. Other improvements. All requested changes made.
Morphemes link to dictionary index - temporary popup shows basic data.
Some errors remain - missing data, incorrect trs files etc.
Comments cannot be synchronised with pages as there is no supporting data.
Please inform me re any other errors etc.

Previous version:

Download: GY_songs_draft1.zip 41 MB
Very first draft of GY CD with song player

Notes/please check the following:

Email me with any other questions etc

David