Does the PDF files that the IFilter will index have to be created from a
word document or other application. Already electronic and then printed to
a pdf or can you scan a paper document into a pdf file and it will index the
contents?

Thanks for the help

Tim Duell
Buckeye Shapeform

Re: PDF Ifilter Question by David

David
Thu Nov 18 08:13:03 CST 2004

AFAIK the PDF has to be stored in the document library (SQL Server 2000
required). period

"Tim Duell" <someone@microsoft.com> wrote in message
news:eBkEwNWzEHA.3708@TK2MSFTNGP14.phx.gbl...
> Does the PDF files that the IFilter will index have to be created from a
> word document or other application. Already electronic and then printed
to
> a pdf or can you scan a paper document into a pdf file and it will index
the
> contents?
>
> Thanks for the help
>
> Tim Duell
> Buckeye Shapeform
>
>



RE: PDF Ifilter Question by englantilainen

englantilainen
Fri Nov 19 01:19:01 CST 2004

David is right in that when you use the WSS Search function to search pdf's
after you have added an Index Filter to your WSS system then the WSS Search
will only search PDF documents that are stored in document libraries in your
WSS system (the contents of attachments are not for instance searched by the
WSS search).

As far as I know it does not matter *how* the PDF file is created. I create
the WSS FAQ pdf version from a Word document using a free Word to PDF program
and contents of it are included in a search on the WSS FAQ site in the US

Mike Walsh Helsinki Finland
WSS FAQ wss.collutions.com

"Tim Duell" wrote:

> Does the PDF files that the IFilter will index have to be created from a
> word document or other application. Already electronic and then printed to
> a pdf or can you scan a paper document into a pdf file and it will index the
> contents?
>
> Thanks for the help
>
> Tim Duell
> Buckeye Shapeform
>
>
>

Re: PDF Ifilter Question by John

John
Sun Nov 21 11:53:15 CST 2004

In general, scanned documents only contain an image and there is no word
content in the PDF to index. One would need to OCR the scanned image in some
fashion to capture the text content.

PDF files created from word processing documents contain text and formatting
instructions; so, the text is indexed by the IFilter.

jlm


"Mike Walsh" <englantilainen@hotmail.com> wrote in message
news:069AFFD9-DE43-4A34-B23A-671F0FC21992@microsoft.com...
> David is right in that when you use the WSS Search function to search
> pdf's
> after you have added an Index Filter to your WSS system then the WSS
> Search
> will only search PDF documents that are stored in document libraries in
> your
> WSS system (the contents of attachments are not for instance searched by
> the
> WSS search).
>
> As far as I know it does not matter *how* the PDF file is created. I
> create
> the WSS FAQ pdf version from a Word document using a free Word to PDF
> program
> and contents of it are included in a search on the WSS FAQ site in the US
>
> Mike Walsh Helsinki Finland
> WSS FAQ wss.collutions.com
>
> "Tim Duell" wrote:
>
>> Does the PDF files that the IFilter will index have to be created from a
>> word document or other application. Already electronic and then printed
>> to
>> a pdf or can you scan a paper document into a pdf file and it will index
>> the
>> contents?
>>
>> Thanks for the help
>>
>> Tim Duell
>> Buckeye Shapeform
>>
>>
>>