Announcing what's coming to your car stereo (and ipod, and car navigation system) in 2007....

ajayjuneja · Jan 23, 2005

So, some of y'all may remember me showing off a little video of song selection in Winamp by way of a dialogue system back from 2 years ago or so...

Well I'd thought I'd give ATOT people a sneak peak of what's to come (I started a company off of this work). The current version of the application is 10 times faster than the version you all saw two years ago, we now can do backtracking and best of all, it uses up 1/25th the amount of RAM as the old version and runs 24/7 where as the old version used to crash in an hour.

The new demo, talking to your stereo

Oh, and as for these
Gracenote people that seemed to announce that their trying to do the same thing... well one of my buddies knows one of the high up people at gracenote... and we'll be talking soon

Oh, and this will be working for your ipod and car navigation system soon too.

----------------------
Features you'll see in the video if you look closely:

1. resolving confusion. There are a couple times I ask for a song name by the wrong artist, and so the system prompts me for that song I asked PLUS all the songs by the artist I asked. There is another example of prompting me when I have two songs with the same title but by different artists (Yes, I know Roger Waters is ex-Pink Floyd, but that is a live version by him).

2. Dealing with lots of noise... there are some parts that are really noisy, like when I ask for the beatles song, I do have to repeat myself once, but the system doesn't get a single utterance wrong! This is on a database of over 1000 songs. I too can't stand that text to speech voice for too long, thankfully we can tell it to shut up. There WILL be better Text to speech voices in the commercial product.

3. The system can tutor you on how to use it when it launches. A "dialogue" can also be used on launch to set up user preferences.

------------------
Other features we have now, but not shown in this video:

1. Nesting of queries. If I said "Play foxtrot" and then after the responses come with a lot I can say "Frank Sinatra" and it will narrow the query to "foxtrots by frank sinatra."

2. Backtracking. You could say "scratch that" or "I didn't mean that..." or orther phrases of that type to undo an action. Backtracking isn't included in music selection due to the simple nature of the task (as compared to car navigation).
-----------------

How's it work? Lots of really complex semantic parsing to determine your sentence structure and it keeps track of what you said before, too. We are the parser, not the speech recognizer.

Cliff notes of above

The system really rocks because it uses semantic parsing and keeps track of the state of the conversation.
Go download the video and let me know if you want to be a beta tester in the near future.

imported_Mike · Jan 23, 2005

:thumbsup: That is badass

ajayjuneja · Jan 23, 2005

Originally posted by: CheapArse
:thumbsup: That is badass

Thank you

I quit my day job at BeVocal so I could commercialize this.

Actually I wasn't at BeVocal very long, because I TOLD them I would continue working on this project 4 months before I started working there, they said they were fine with that, then 6 weeks AFTER I started they decided they weren't fine with that, so I said "buh-bye."

It was a sign from god letting me know that I should spend all my time on doing what I really wanted to spend all of my time doing. And poof, all cylinders are a firing now.

System specs to run it:

Pentium-Pro 200 Mhz, 64 MB RAM
Windows NT/2000/XP/2003, or Linux.
I am debating a mac mini port (would y'all buy it for the mac mini?)
And I am working on an Intel-XSCALE port for Windows CE & Linux.

imported_Mike · Jan 23, 2005

Originally posted by: ajayjuneja

Originally posted by: CheapArse
:thumbsup: That is badass

Click to expand...

Thank you I quit my day job at BeVocal so I could commercialize this.

Actually I wasn't at BeVocal very long, because I TOLD them I would continue working on this project 4 months before I started working there, they said they were fine with that, then 6 weeks AFTER I started they decided they weren't fine with that, so I said "buh-bye."

It was a sign from god letting me know that I should spend all my time on doing what I really wanted to spend all of my time doing. And poof, all cylinders are a firing now.

System specs to run it:

Pentium-Pro 200 Mhz, 64 MB RAM
Windows NT/2000/XP/2003, or Linux.
I am debating a mac mini port (would y'all buy it for the mac mini?)
And I am working on an Intel-XSCALE port for Windows CE & Linux.

So how will we be able to run something like this in our cars?

ajayjuneja · Jan 23, 2005

Originally posted by: CheapArse

So how will we be able to run something like this in our cars?

It'll work it's way into units like the Pioneer AVIC-N1/N2... that's what the xscale chip port is for.

Think of having a car stereo unit in a single din that has:
1. LCD touchscreen
2. Linux
3. WiFi
4. Hard drive or a bucketload of flash memory
5. USB port or Firewire port
6. Real time traffic data.
7. Dialogue system.

------------

I personally, am sticking a computer into my car with an opus 12 volt power supply.

imported_Mike · Jan 23, 2005

Originally posted by: ajayjuneja

Originally posted by: CheapArse

So how will we be able to run something like this in our cars?

Click to expand...

It'll work it's way into units like the Pioneer AVIC-N1/N2... that's what the xscale chip port is for.

Think of having a car stereo unit in a single din that has:
1. LCD touchscreen
2. Linux
3. WiFi
4. Hard drive or a bucketload of flash memory
5. USB port or Firewire port
6. Real time traffic data.
7. Dialogue system.

------------

I personally, am sticking a computer into my car with an opus 12 volt power supply.

So you've developed the software?(that is what the video is demonstrating right?) Are you in talks with pioneer/alpine etc. already or is that where you'd like your company to go in the near future?

EDIT: I know the video is demonstrating how the program works

Megatomic · Jan 23, 2005

I can't watch the video while at work, I'll have to check it out in a few hours when I get home.

*has high expectations*

ajayjuneja · Jan 23, 2005

So you've developed the software?(that is what the video is demonstrating right?) Are you in talks with pioneer/alpine etc. already or is that where you'd like your company to go in the near future?

EDIT: I know the video is demonstrating how the program works

Yes, it's the SOFTWARE we developed, then we'll help with some of the hardware integration aspects. We are in touch with TeleAtlas (map supplier to Honda/Acura, VW/Audi, Nissan/Infiniti, Porsche, Mercedes Benz, Pioneer, Alpine, Siemans/VDO, and Blaupunkt/Bosch).

Also in touch directly with Apple, Gibson Audio (the Wurlitzer jukebox), the creator of the Acura RL Navigation System, as well as the XM Satellite Radio people who did the real time traffic data for the Acura RL & Pioneer Avic-N2. Bosch was one of our funders when this was a research project, so we have had long time ties with them (I like them).

imported_Mike · Jan 23, 2005

Originally posted by: ajayjuneja

So you've developed the software?(that is what the video is demonstrating right?) Are you in talks with pioneer/alpine etc. already or is that where you'd like your company to go in the near future?

EDIT: I know the video is demonstrating how the program works

Click to expand...

Yes, it's the SOFTWARE we developed, then we'll help with some of the hardware integration aspects. We are in touch with TeleAtlas (map supplier to Honda/Acura, VW/Audi, Nissan/Infiniti, Porsche, Mercedes Benz, Pioneer, Alpine, Siemans/VDO, and Blaupunkt/Bosch).

Also in touch directly with Apple, Gibson Audio (the Wurlitzer jukebox), the creator of the Acura RL Navigation System, as well as the XM Satellite Radio people who did the real time traffic data for the Acura RL & Pioneer Avic-N2. Bosch was one of our funders when this was a research project, so we have had long time ties with them (I like them).

Nice, these sound promising:

1. LCD touchscreen
2. Linux
3. WiFi
4. Hard drive or a bucketload of flash memory
5. USB port or Firewire port
6. Real time traffic data.
7. Dialogue system.

Sign me up to be a beta tester!

DaWhim · Jan 23, 2005

I will give your a :thumbsdown:, if that is only compatible with ipod.

imported_Mike · Jan 23, 2005

Originally posted by: DaWhim
I will give your a :thumbsdown:, if that is only compatible with ipod.

Did you even watch the video?

DaWhim · Jan 23, 2005

Originally posted by: CheapArse

Originally posted by: DaWhim
I will give your a :thumbsdown:, if that is only compatible with ipod.

Click to expand...

Did you even watch the video?

nope...

chuckywang · Jan 23, 2005

marked for later

z0mb13 · Jan 23, 2005

hmm so basically you developed a new compression?

Wanescotting · Jan 23, 2005

awesome concept

nsafreak · Jan 23, 2005

Holy COW that's impressive. I must say that the semantic parsing seems to work pretty darn well. I've used speech recognition software before and usually there was a training period for the software to get used to the nuances of a way a person spoke. Does this software have to be trained as well or is that not necessary? I also like how it will give you the various songs available for each artist. Can it do it by album as well? Very very nice work I must say. I would be interested in beta testing the software when you get to that stage. I do have previous experience beta testing software as well if that helps at all.

Cristatus · Jan 23, 2005

big :thumbsup: on the software, but one question, or rather comment/quirk: are you going to be using the LH voice modules on your piece of software? i wouldn't mind if you do, but it's just that it's hard to understand it sometimes...also, rather annoyingly roboticl, but i guess there's nothing you can do about that, is there?

h8red · Jan 23, 2005

Very very nice job. I don't really care for the voice modulation either - but still excellent job

kyparrish · Jan 23, 2005

sweet!

Rogue · Jan 23, 2005

After seeing the video, I have a few suggestions from a user standpoint if you're willing to hear them:

1) Sometimes the voice response would speak over the start of a song. perhaps delay the song based on the length of voice response

2) perhaps a "shhhh" command to immediately quiet the voice response

That is all...

RaynorWolfcastle · Jan 23, 2005

looks nice, but I would set it up so that when it recognizes that you're giving a command, it automatically puts the volume down to 10% or something. As a user, I have no interest in yelling over the music. Also, I imagine you've made plans for how it will deal with people having a conversation in the car? Probably an activation button on the steering wheel or something?

Freejack2 · Jan 23, 2005

Very very nice. I'd also like to be a beta tester when you get to that stage. Something like this would be great for home and especially car use so I don't have to look down to search through the tracks.

Turkish · Jan 23, 2005

Originally posted by: DaWhim
I will give your a :thumbsdown:, if that is only compatible with ipod.

ignorance and stupidity of unseen monolithic proportions. die under a 700 lbs. McDonalds loving women while she is playing with her nipples.

daniel1113 · Jan 23, 2005

I am very imrpessed, but I have a few questions:

How well does this work in a driving scenario? For example, combine the music with wind noise, road noise, etc.

Also, how does the system handle odd spellings? For example, you asked the computer to play a song by "Caine." What if the artist's name was "Kaine?" Would the system still find the song?

quakefiend420 · Jan 23, 2005

looks very cool...gotta do something about that annoying voice though

Announcing what's coming to your car stereo (and ipod, and car navigation system) in 2007....

Golden Member

Lifer

Golden Member

Lifer

Golden Member

Lifer

Lifer

Golden Member

Lifer

Lifer

Lifer

Lifer

Lifer

Lifer

Diamond Member

Diamond Member

Diamond Member

Senior member

Diamond Member

Banned

Diamond Member

Diamond Member

Lifer

Diamond Member

Lifer