Register
Forgot Password ?

 

Got a question for us?

Get help from our team within 24 hours

Problems to access downloads, install or configure a module? Remarks, bugs or suggestions? We treat your request within 24 hours.

Thanks for your feedback!

 
  Aricie  Expert Modules  LuceneSearch  PDF Indexing
Previous Previous
 
Next Next
New Post 8/30/2010 7:01 AM
Unresolved
User is offline Andrew
2 posts
No Ranking


PDF Indexing 

Further to:

This search module indexes modules (not documents). Therefore, your pdfs need to be exposed to indexation via modules such as Repository or Documents. You will then select in LuceneSearch settings to index Repository and Document modules installed on your portal. Keep us posted, we are here to help

How do I expose pdf documents in "Documents" or "Repository" modules to Lucene indexing. Step by step pointer please. Thank you. The module is excellent.

 
New Post 8/30/2010 10:57 AM
User is offline Aurélien Catinon
17 posts
No Ranking


Re: PDF Indexing 

Hello,

On the module settings, then Result Settings > Index Settings > Indexing Content Settings, verify that your module is checked in “Indexed Modules”. (Documents for instance)

Then, in Results Settings, you have to check if you have “Text” checked in “Search Fields”. If the module Document wasn’t checked in the previous step, you should re-index before seeing this field. The “Text” field match the pdf content.

You can use the tool Luke to analyze what is indexed. http://www.getopt.org/luke/ (Use the java webstart version).

Regards,

Aurélien Catinon

 
New Post 8/31/2010 2:32 AM
User is offline Andrew
2 posts
No Ranking


Re: PDF Indexing 

Hi Aurelien,

Thanks for the pointers. I cannot see a "Text" field even after checking just "Documents". Any idea why. So close...

I have added a pdf file to a "documents" module on another page but cannot get its content indexed in Lucene.The "text" field is

eluding me. Please help.

Thank you

 

Andrew

 
New Post 8/31/2010 10:42 AM
User is offline Aurélien Catinon
17 posts
No Ranking


Re: PDF Indexing 

Hi,

Have you tried to re-index ?

In Host > Search admin, click on "Re-index content", then go back to lucene settings. The "Text" field should apears. If not, check the event logs for an eventual exception.

Regards,

Aurélien

 
New Post 9/8/2010 6:42 PM
User is offline mike mack
2 posts
No Ranking


Re: PDF Indexing 

 Hi Aurelien,

I'm having the same problem with setting this up.  I've reindexed and see no "Text" field. 

I get Author Name, Page Title, Page Name, Title, Parent Page Title, Description, Indexed Date, Page Description, Published Date, Parent Page Name, PubHistory, Module Title, Content.

In "Indexed Modules" I have checked off Blog, Documents, Events Calendar, Gallery, and HTML Pro

Is it possible that because we're using HTML Pro (Telerik Editor) instead of the FCK Editor that this is happening?

Thanks,

Michael

 
New Post 9/13/2010 7:09 PM
User is offline mike mack
2 posts
No Ranking


Re: PDF Indexing 

No idea?

 
New Post 2/23/2011 12:51 PM
User is offline i.senonder
3 posts
No Ranking


Re: PDF Indexing 
Hi, i am experiencing the same issue here. But not only with pdfs. I installed the module to my site and added it to my page, activated it succesfully. I re-indexed my site in host>search admin panel and then, i made some searches. There are two html modules in another page, one has some sentences written in it, other has a word file (.doc) inserted. When i search the words in first module, LuceneSearch finds it successfully, there isn't any problem here. But when i search a word inside the .doc file in the second html module, it doesn't show any results. I am not very experienced in this issue so can someone explain me what i'm doing wrong? Thanks in advance, regards.
 
New Post 3/3/2011 11:27 AM
User is offline cgaspard
106 posts
10th Level Poster


Re: PDF Indexing 

 Hi,

We are working on a new release of Aricie.LuceneSearch Module. We'll get back to you asap.

 

@i.senonder : Notice that the document indexation is working only if yours documents are in a document module like Documents or Repository.

 

Thanks for your patience
Best regards.


Célian GASPARD
Développeur
Société de conseil et de service en informatique et systèmes d'information
 
New Post 3/28/2011 10:08 AM
User is offline cgaspard
106 posts
10th Level Poster


Re: PDF Indexing 
Modified By cgaspard  on 3/28/2011 9:31:32 AM)

  Hi,

the new release of Aricie - LuceneSearch Module is available in the download section. The PDf indexing process is unchanged.

- Check documents or repository module for indexing

- re-index content (the document will be shown in the all results list already)

- Check the "text" field for activate search on the content of the PDF.

 

A new functionnality is available : a stand alone document indexation

In the indexer settings, you can enabled the sand alone documents provider. When enabled, you can select the folder and files extension you want index.

Note : For Excel or Words indexing, you may have installed the IFlter for Office Documents on your webserver. This components is available here :
http://www.microsoft.com/downloads/en/details.aspx?FamilyId=60C92A37-719C-4077-B5C6-CAC34F4227CC&displaylang=en

 

Best regards,
 

 


Célian GASPARD
Développeur
Société de conseil et de service en informatique et systèmes d'information
 
Previous Previous
 
Next Next
  Aricie  Expert Modules  LuceneSearch  PDF Indexing