• We should now be fully online following an overnight outage. Apologies for any inconvenience, we do not expect there to be any further issues.

project: convert 1,081 pages of hand written text to word

Page 2 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

JoLLyRoGer

Diamond Member
Aug 24, 2000
4,153
4
81
Yeah, post up some of the stuff here with an e-mail link for returning a .doc file. I'd do a couple of pages for you just for the h3ll of it.

I'm sure there are enough bored people around here that would help contribute. In fact, you could call it the ATOT Project and maybe even get a sticky from the mods,

Set up a webpage linking to the scanned pages and remove each page as you get the pages back in .doc format.

Just a thought...
 

theNEOone

Diamond Member
Apr 22, 2001
5,745
4
81
Originally posted by: JoLLyRoGer
Yeah, post up some of the stuff here with an e-mail link for returning a .doc file. I'd do a couple of pages for you just for the h3ll of it.

I'm sure there are enough bored people around here that would help contribute. In fact, you could call it the ATOT Project and maybe even get a sticky from the mods,

Set up a webpage linking to the scanned pages and remove each page as you get the pages back in .doc format.

Just a thought...
that's an idea i'd be willing to explore. because it's over 1,000 pages i would probably put up the scans in 10-page bunches. my main concern is having two people work on the same batch at the same time...of course i would only know until i received both completed submissions.

is there any way to allow uses to post their name next to a particular 10-page scan bundle so that others know that it's been downloaded and being worked on....maybe there's also some way of allowing the same user to post when it was downloaded and/or to update progress, in case s/he decides to drop the project. i can design the website just fine, the problem is this dynamic type of updating that i'm completely unfamiliar with.


=|
 

AbsolutDealage

Platinum Member
Dec 20, 2002
2,675
0
0
Originally posted by: theNEOone
Originally posted by: sygyzy
Stop spewing the OCR crap here. OCR software, at least stuff that the OP can afford, does not convert handwritten text. It's unbelievable the amount of misinformation that you guys like to spread. You are not being helpful. Hire someone to type it or type it yourself.
i too was under the impression that OCR software was soley for typed text on paper that you can scan and convert to editable text.

however, tablet PCs allow handwritten input and OneNote (which is pretty cheap) can convert your handwriting to text very reliably...even cursive writing. i used to own a tablet and was suprised at how accurate the conversion was. you can even have two or three different types of handwriting in a single sentence and it works great.

I agree. I had a chance to play with a tablet PC some time ago... and the handwriting recognition on there was freakishly good.
=|

 

sygyzy

Lifer
Oct 21, 2000
14,001
4
76
Originally posted by: AbsolutDealage
Originally posted by: theNEOone
Originally posted by: sygyzy
Stop spewing the OCR crap here. OCR software, at least stuff that the OP can afford, does not convert handwritten text. It's unbelievable the amount of misinformation that you guys like to spread. You are not being helpful. Hire someone to type it or type it yourself.
i too was under the impression that OCR software was soley for typed text on paper that you can scan and convert to editable text.

however, tablet PCs allow handwritten input and OneNote (which is pretty cheap) can convert your handwriting to text very reliably...even cursive writing. i used to own a tablet and was suprised at how accurate the conversion was. you can even have two or three different types of handwriting in a single sentence and it works great.

I agree. I had a chance to play with a tablet PC some time ago... and the handwriting recognition on there was freakishly good.
=|

You are absolutely correct Neo, however it is irrelevant since your data is in the form of handwritten hardcopy. I know what you are thinking. Why not adopt that technology over? Well good luck with that ...
 

zephyrprime

Diamond Member
Feb 18, 2001
7,512
2
81
There are some websites where you can get bids on work like this. Nothing like cheap Indian/Eastern european labor.
 

JoLLyRoGer

Diamond Member
Aug 24, 2000
4,153
4
81
Originally posted by: theNEOone
Originally posted by: JoLLyRoGer
Yeah, post up some of the stuff here with an e-mail link for returning a .doc file. I'd do a couple of pages for you just for the h3ll of it.

I'm sure there are enough bored people around here that would help contribute. In fact, you could call it the ATOT Project and maybe even get a sticky from the mods,

Set up a webpage linking to the scanned pages and remove each page as you get the pages back in .doc format.

Just a thought...
that's an idea i'd be willing to explore. because it's over 1,000 pages i would probably put up the scans in 10-page bunches. my main concern is having two people work on the same batch at the same time...of course i would only know until i received both completed submissions.

is there any way to allow uses to post their name next to a particular 10-page scan bundle so that others know that it's been downloaded and being worked on....maybe there's also some way of allowing the same user to post when it was downloaded and/or to update progress, in case s/he decides to drop the project. i can design the website just fine, the problem is this dynamic type of updating that i'm completely unfamiliar with.


=|


.php is your friend!

You could write a .php page that let users submit their name in a text box and submit it against a batch of papers and even go so far as to remove the link once someone has signed up for a particular batch. I could possibly help you with some of the code even... I can't imagine it being too complicated.

JR..
 

mugs

Lifer
Apr 29, 2003
48,920
46
91
Originally posted by: theNEOone
Originally posted by: JoLLyRoGer
Yeah, post up some of the stuff here with an e-mail link for returning a .doc file. I'd do a couple of pages for you just for the h3ll of it.

I'm sure there are enough bored people around here that would help contribute. In fact, you could call it the ATOT Project and maybe even get a sticky from the mods,

Set up a webpage linking to the scanned pages and remove each page as you get the pages back in .doc format.

Just a thought...
that's an idea i'd be willing to explore. because it's over 1,000 pages i would probably put up the scans in 10-page bunches. my main concern is having two people work on the same batch at the same time...of course i would only know until i received both completed submissions.

is there any way to allow uses to post their name next to a particular 10-page scan bundle so that others know that it's been downloaded and being worked on....maybe there's also some way of allowing the same user to post when it was downloaded and/or to update progress, in case s/he decides to drop the project. i can design the website just fine, the problem is this dynamic type of updating that i'm completely unfamiliar with.


=|

1. User posts saying he's willing to help.
2. You send user 10 pages worth of scans
3. User sends typed pages back to you
4. You check off that section as done.
5. If user never sends it back to you, you send it to someone else.

Having everything in .txt format would make it easier to combine the work of multiple people. With .doc you'd have to copy and paste everything into one big document, with text files you could just write a quick script to concatenate them all into 1 file.
 

sygyzy

Lifer
Oct 21, 2000
14,001
4
76
Originally posted by: JoLLyRoGer
Originally posted by: theNEOone
Originally posted by: JoLLyRoGer
Yeah, post up some of the stuff here with an e-mail link for returning a .doc file. I'd do a couple of pages for you just for the h3ll of it.

I'm sure there are enough bored people around here that would help contribute. In fact, you could call it the ATOT Project and maybe even get a sticky from the mods,

Set up a webpage linking to the scanned pages and remove each page as you get the pages back in .doc format.

Just a thought...
that's an idea i'd be willing to explore. because it's over 1,000 pages i would probably put up the scans in 10-page bunches. my main concern is having two people work on the same batch at the same time...of course i would only know until i received both completed submissions.

is there any way to allow uses to post their name next to a particular 10-page scan bundle so that others know that it's been downloaded and being worked on....maybe there's also some way of allowing the same user to post when it was downloaded and/or to update progress, in case s/he decides to drop the project. i can design the website just fine, the problem is this dynamic type of updating that i'm completely unfamiliar with.


=|


.php is your friend!

You could write a .php page that let users submit their name in a text box and submit it against a batch of papers and even go so far as to remove the link once someone has signed up for a particular batch. I could possibly help you with some of the code even... I can't imagine it being too complicated.

JR..


Great idea and great offer by JR.

As for scanning, as long as it's on regular sized paper, you can bring it to Kinko's or buy a sheetfeeder. 1000 pages is not that difficult and does not even have to be attended. You can just leave it and go to lunch.
 

RaynorWolfcastle

Diamond Member
Feb 8, 2001
8,968
16
81
1) Find a scanner with an automatic sheet feeder
2) Outsource trancription job to India
3) hire an editor to go through the trancribed version
 

Rip the Jacker

Diamond Member
Dec 29, 2004
5,415
1
76
Hire me. Or Otaking. Or the other fast typers in AT..

edit: Actually, 1081 pages... nevermind, unless its for a good price.
 

rezinn

Platinum Member
Mar 30, 2004
2,418
0
0
Originally posted by: mugs
Originally posted by: theNEOone
Originally posted by: JoLLyRoGer
Yeah, post up some of the stuff here with an e-mail link for returning a .doc file. I'd do a couple of pages for you just for the h3ll of it.

I'm sure there are enough bored people around here that would help contribute. In fact, you could call it the ATOT Project and maybe even get a sticky from the mods,

Set up a webpage linking to the scanned pages and remove each page as you get the pages back in .doc format.

Just a thought...
that's an idea i'd be willing to explore. because it's over 1,000 pages i would probably put up the scans in 10-page bunches. my main concern is having two people work on the same batch at the same time...of course i would only know until i received both completed submissions.

is there any way to allow uses to post their name next to a particular 10-page scan bundle so that others know that it's been downloaded and being worked on....maybe there's also some way of allowing the same user to post when it was downloaded and/or to update progress, in case s/he decides to drop the project. i can design the website just fine, the problem is this dynamic type of updating that i'm completely unfamiliar with.


=|

1. User posts saying he's willing to help.
2. You send user 10 pages worth of scans
3. User sends typed pages back to you
4. You check off that section as done.
5. If user never sends it back to you, you send it to someone else.

Having everything in .txt format would make it easier to combine the work of multiple people. With .doc you'd have to copy and paste everything into one big document, with text files you could just write a quick script to concatenate them all into 1 file.

 

JoLLyRoGer

Diamond Member
Aug 24, 2000
4,153
4
81
Originally posted by: rezinn
Originally posted by: mugs
Originally posted by: theNEOone
Originally posted by: JoLLyRoGer
Yeah, post up some of the stuff here with an e-mail link for returning a .doc file. I'd do a couple of pages for you just for the h3ll of it.

I'm sure there are enough bored people around here that would help contribute. In fact, you could call it the ATOT Project and maybe even get a sticky from the mods,

Set up a webpage linking to the scanned pages and remove each page as you get the pages back in .doc format.

Just a thought...
that's an idea i'd be willing to explore. because it's over 1,000 pages i would probably put up the scans in 10-page bunches. my main concern is having two people work on the same batch at the same time...of course i would only know until i received both completed submissions.

is there any way to allow uses to post their name next to a particular 10-page scan bundle so that others know that it's been downloaded and being worked on....maybe there's also some way of allowing the same user to post when it was downloaded and/or to update progress, in case s/he decides to drop the project. i can design the website just fine, the problem is this dynamic type of updating that i'm completely unfamiliar with.


=|

1. User posts saying he's willing to help.
2. You send user 10 pages worth of scans
3. User sends typed pages back to you
4. You check off that section as done.
5. If user never sends it back to you, you send it to someone else.

Having everything in .txt format would make it easier to combine the work of multiple people. With .doc you'd have to copy and paste everything into one big document, with text files you could just write a quick script to concatenate them all into 1 file.


Yeah! It's called "cat /mydirectory/* >> newfile.txt"

Of course you'd need to install a port of your favorite UNIX shell commands to work with DOS. They're out there if you look around.. ;)
 

Lonyo

Lifer
Aug 10, 2002
21,938
6
81
Originally posted by: theNEOone
Originally posted by: sygyzy
Stop spewing the OCR crap here. OCR software, at least stuff that the OP can afford, does not convert handwritten text. It's unbelievable the amount of misinformation that you guys like to spread. You are not being helpful. Hire someone to type it or type it yourself.
i too was under the impression that OCR software was soley for typed text on paper that you can scan and convert to editable text.

however, tablet PCs allow handwritten input and OneNote (which is pretty cheap) can convert your handwriting to text very reliably...even cursive writing. i used to own a tablet and was suprised at how accurate the conversion was. you can even have two or three different types of handwriting in a single sentence and it works great.


=|
I was basing it on the OCR software I downloaded a couple of days ago that gave the option for hand written or typed text to be converted, so I assumed it would (try and) convert hand written text as well as printed stuff.
But I didn't investigate it in the end since it didn't do what was required for block text.