Problems to access downloads, install or configure a module? Remarks, bugs or suggestions? We treat your request within 24 hours.
Thanks for your feedback!
Further to:
This search module indexes modules (not documents). Therefore, your pdfs need to be exposed to indexation via modules such as Repository or Documents. You will then select in LuceneSearch settings to index Repository and Document modules installed on your portal. Keep us posted, we are here to help
How do I expose pdf documents in "Documents" or "Repository" modules to Lucene indexing. Step by step pointer please. Thank you. The module is excellent.
Hello,
On the module settings, then Result Settings > Index Settings > Indexing Content Settings, verify that your module is checked in “Indexed Modules”. (Documents for instance)
Then, in Results Settings, you have to check if you have “Text” checked in “Search Fields”. If the module Document wasn’t checked in the previous step, you should re-index before seeing this field. The “Text” field match the pdf content.
You can use the tool Luke to analyze what is indexed. http://www.getopt.org/luke/ (Use the java webstart version).
Regards,
Aurélien Catinon
Hi Aurelien,
Thanks for the pointers. I cannot see a "Text" field even after checking just "Documents". Any idea why. So close...
I have added a pdf file to a "documents" module on another page but cannot get its content indexed in Lucene.The "text" field is
eluding me. Please help.
Thank you
Andrew
Hi,
Have you tried to re-index ?
In Host > Search admin, click on "Re-index content", then go back to lucene settings. The "Text" field should apears. If not, check the event logs for an eventual exception.
Aurélien
I'm having the same problem with setting this up. I've reindexed and see no "Text" field.
I get Author Name, Page Title, Page Name, Title, Parent Page Title, Description, Indexed Date, Page Description, Published Date, Parent Page Name, PubHistory, Module Title, Content.
In "Indexed Modules" I have checked off Blog, Documents, Events Calendar, Gallery, and HTML Pro
Is it possible that because we're using HTML Pro (Telerik Editor) instead of the FCK Editor that this is happening?
Thanks,
Michael
No idea?
We are working on a new release of Aricie.LuceneSearch Module. We'll get back to you asap.
@i.senonder : Notice that the document indexation is working only if yours documents are in a document module like Documents or Repository.
Thanks for your patience Best regards.
the new release of Aricie - LuceneSearch Module is available in the download section. The PDf indexing process is unchanged.
- Check documents or repository module for indexing
- re-index content (the document will be shown in the all results list already)
- Check the "text" field for activate search on the content of the PDF.
A new functionnality is available : a stand alone document indexation
In the indexer settings, you can enabled the sand alone documents provider. When enabled, you can select the folder and files extension you want index.
Note : For Excel or Words indexing, you may have installed the IFlter for Office Documents on your webserver. This components is available here : http://www.microsoft.com/downloads/en/details.aspx?FamilyId=60C92A37-719C-4077-B5C6-CAC34F4227CC&displaylang=en
Best regards,
DotNetNuke Modules | Licence | Download | Subscribe Now | Support | Contact