All posts by snowdeal

well, whadya know. AOLiza passed the turing test. or at least aol users ruined the curve for everyone:

“”I noticed AIM [AOL Instant Messenger] had a robust script library,” says Fox. “If you could hook it into a program, which would it be?” The answer he came up with was Eliza, an AI Perl program that responds to text messages with psychoanalytic-styled questions. He dubbed his new hybrid “AOLiza,” and set “her” loose on August 15.

Once AOLiza was up and running, it wasn’t long before she was fooling unsuspecting AOLers who messaged him by mistake. Now, AOL may not exactly be the digerati hangout that, say, The Well is, but its members aren’t stupid either — or at least all of them aren’t. AOLiza was working, and working pretty well. Users began carrying on elaborate conversations with AOLiza.

One person — who Fox dubbed “five” — even entered into a drawn out conversation about his ex-girlfriend with Eliza”

oooohhhhh, the possibilities are endless:

“Many of Britain’s most eminent medical scientists believe the birth of a cloned baby is inevitable despite
society’s current aversion to the idea.

More than half of a panel of 32 scientists surveyed by The Independent said “reproductive cloning” would be attempted within 20 years if the technical and safety issues could be overcome.”

i see a whole army of me in the future. think of all the blogging i could get done.

ha! this bit on navigable maps of the web won’t be found on slashdot:

“Mapping the Web is a huge field that falls into two main pieces: maps that show us something interesting about the Web, and maps that help us navigate the Web. These two need not be essentially connected.”

“There will be many solutions to this problem, and which ones we like will have everything to do with our personal way of thinking and the type of problem we’re trying to solve at the moment. But navigable maps of Web sites clustered by relevancy to our interests are of unique important to the Web because the Web space is itself organized not by uniform units of distance but by *interest* itself. Distance, on the Web, is measured by irrelevance. Navigable maps capture this essential fact of our new world, and thus not only map Web distance but conquer it.”

and just in case you thought i was beating someone to the punch. don’t. because, i appropriated it from peterme.

note to self. check out alphanumerica’s themebuilder.

“The Theme Builder is a graphical tool made to simplify the creation of themes for Mozilla. This beta version of the tool gives users the ability to apply new graphic designs to the browser without changing its functionality. Individuals can use the tool to create their own personal themes while companies can “brand” the browser with their company colors and logo.”

oh for cyring out loud, yes, i know it was on slashdot. what can i say, it’s all slashdot all the time today.

the slashdot crowd is discussing [sic] tuneprint. one of the creators speaks up to improve the signal-to-noise ratio:

“The general idea is pretty simple. We take the input audio. We condition it (adjust it to a known sampling rate and volume.) We pass it through the psychoacoustic model (it’s about a notch more complicated than what you’d see in a mp3 encoder, which ain’t saying much. This is all stuff that was mostly hashed out decades ago.) This model effectively strips the parts of the sound you can’t hear — the desired result being that even if the audio has been compressed or manipulated subaudibly, the
result is still the same. Okay, so the net result of all of this is a vector that covers a very small segment (fraction of a second) of audio. We stack several of these vectors (possibly separated in time by a bit) side-by-side to get a big vector. Then we do completely boring and standard and well-understood statistical and pattern-matching stuff on the vector to make it smaller and more palatable for the server — think of it as lossy compression. Then it goes off to the server. The server is about equal in
complexity to a text search engine. (I say this fully realizing that I have only a vague impression how Google works. It’s certainly a lot more complicated than the obvious hash-table-of-sorted-lists stuff.) It finds the database vector that’s the best match in a fairly boring but efficient way. (No, it does not involve searching through all tracks one by one, no more than Altavista searches through all web pages one by one every time you want to find some porn.) Call the result a submatch. Back at the client, the whole process is repeated a bunch more times, generating a stream of submatches (“Radiohead offset 0.. Radiohead offset 1024 or 16384.. Slashdot’s Gr34test Hits 5262324.. Radiohead offset 3072..”) from the input audio stream. Then, the client looks at the submatches and tries to figure out what the input audio was and where the song boundaries are (did somebody really stick in a sample from Slashdot’s Gr34test Hits, or was that just an unlucky match?)

See? Not magic. It’s a challenging problem, but not an impossible problem. The reason that this doesn’t exist right now is not that generations of scientists have tried and failed, but rather that people didn’t care too much until lately and nobody’s gotten off their ass and done anything about it yet. I like big but approachable problems, which is one of the reasons I’m excited about this.

FOR ALL OF YOU WHO FELL ASLEEP THROUGH THAT: YOU CANNOT ADD AN INAUDIBLE TONE TO THE MUSIC AND BREAK TUNEPRINT. THE FINGERPRINT IS BASED ON THE LARGE-SCALE PSYCHOACOUSTIC FEATURES OF THE MUSIC. IF MP3 ENCODERS CAN DO IT, SO CAN WE. Maybe not perfectly, but enough to have a fighting chance. THAT’S THE WHOLE POINT HERE. ”

so take that, naysayers.