Archive for May, 2009

Wolfram Alpha came out today.  For anyone who hasn’t heard of it, Wolfram Alpha is a “computational knowledge engine” created by Stephen Wolfram which draws information from various Internet sources.  Anyway, I thought I’d play around with what it does with langauge data.

Searching languages is fairly straightforward.  Search English or Spanish for example, and you get a form giving basic info — number of speakers character frequency, lexical simiarity (what languages share the most cognates with it), genetic classification and major regions where it’s spoken.  If I search “Chinese“, it defaults to Mandarin Chinese but also gives me an option to go to Chinese langauges, which gives me links to several Chinese/Sinitic languages as well as a link to Pidgin English Chinese, which leads to a comparison of English, Chinese, and Hawaii Creole English  — which I had thought referred to a creole of English and Hawai’ian — the Wikipedia page doesn’t mention Mandarin, though it says Cantonese was a major source language — along with Portuguese, Japanese, and several others.  That page also gives you the option to go to Tok Pisin which also doesn’t seem to have such a strong Madarin influence according to Wikipedia.

A couple other things I find: Each language entry lists the numbers 1-10 (taken from Zompist’s list) but no mention is made of what numerical base the language uses.  While pretty much all major languages use a decimal base, there are a number of languages, particularly in Mesoamerica, that use a visegimal base (base-20) and there are other bases in use.  (I should note that “vigesimal” leads to a dictionary definition, and a serch for “base 20” seems to take you to something about DNA base-pairs.)

Another issue is with writing systems.  Several languages correctly identify the writing system (Latin alphabet for English and Spanish) others are not so helpful.  The entry for Japanese lists the “Chinese script”, apparently ignoring the kana.  And searching for Latin alphabet and various other scripts turns up nothing useful, and even more seems to associate a script only with a particular language.  I found that Devanagari has some useful information listed, but Cyrillic only has a dictionary definition.  “Chinese characters”, “hanzi”, “kanji”, and “hanja” all turned up nothing, though you can find something with “Chinese script” — though it’s really only a Mandarin-specific block of Unicode code points.

Basically Wolfram Alpha looks like a very nice little tool for research, but it has some kinks.  This is really just a small segment of what the system can do, and being that it is built on a mathematics engine I’m sure that it does a lot better with things like equations and statistics.  For example: Each of my language searches turned up good info on number of speakers, character frequency, and even an estimate of translation length (based on character count).  The semantic search could definitely use some improvement, and I’m sure it will in the future.  Ultimately I think this system will find a niche or maybe several.  But it’s definitely not going to kill Google or Wikipedia.  In fact, I can really only see it being a compliment to all the info we already have on the web.


Read Full Post »

So, since I’m not in Mexico at the moment.  (I could go later in July, but I’m not going to do that, too close to the China trip), I decided I’d post about some things I’m doing to keep myself busy during the summer.

One project I’ve been working on is a couple of nativity scenes I promised to make for my mom and my sister. They’re plywood cutouts, similar to a multiple-piece nativity set I built for my church for my Eagle Scout project a few years ago.  These scenes I’m working on now are single-piece jobs cut from the same pattern, a lot easier job.  I’m maintaining a Flickr set documenting my progress.  Pics below… (more…)

Read Full Post »

Alright, I promise I won’t turn this into “swine flu blog” but I thought I might do a little commentary on the news as it stands worldwide.  According to the latest WHO update, swine flu has caused about 25 deaths in Mexico (out of 590 confirmed cases).  Worldwide there are a total of 985 cases that have resulted in a grand total of … 26 deaths (one was in the US).  That’s about a 2.6% death rate — before counting the extenuating circumstances that probably lead to a higher death rate in Mexico (poverty leading to poor health care and late detection).

So the new H1N1 is no where near as bad as the hype surrounding it.  So why do we have Egypt killing all pigs in the countryChina detaining Mexican citizens even if they haven’t been to Mexico in the past months, and of course, the latest in useless, unhelpful measures — this: (more…)

Read Full Post »