You need to sign in to do that
Don't have an account?

Parsing of Word/PDF documents
I need to automate our new candidate creation process for candidates that we have resumes for. We would like to parse name, address,email,phone information from these Word/PDF documents and then pre-populate the relevant salesforce fields when creating new candidates.
Kindly advice me on the best approach to achieve this.
A 100% salesforce solution, I dont think I could provide at this time.
If you don't mind a system to system architecture, you could use Apache POI or Apache TIKA to extract the content from the resumes, into a local DB or even text file.
Then, send that data over to salesforce via web services API.
2 steps.
This way, you don't re-invent the wheel when parsing the documents in salesforce.
@Sayasoni - were you able to find 100% force.com (salesforce) solution?
Thanks