solr highlighting like google
up vote
-1
down vote
favorite
I want to make a search engine like Google with Solr, for indexing a pdf. Can I make my search engine same as Google or not ? If yes, I don't know how to do it. I've search, and found this, but not working. I'm too confused what to do first, edit the schema ? Edit the solrconfig
? What code I've to put in PHP and where the line is.
PS : I'm using Linux, Solr 7.5.0
php linux solr highlight
add a comment |
up vote
-1
down vote
favorite
I want to make a search engine like Google with Solr, for indexing a pdf. Can I make my search engine same as Google or not ? If yes, I don't know how to do it. I've search, and found this, but not working. I'm too confused what to do first, edit the schema ? Edit the solrconfig
? What code I've to put in PHP and where the line is.
PS : I'm using Linux, Solr 7.5.0
php linux solr highlight
Start with the highlighting guide in the reference manual. You give these parameters as part of the query string.
– MatsLindh
Nov 22 at 14:29
i've tried like the reference, but in my search result just still the same. i added "&hl=true&hl.fl=content" in the code isn't working. mmm, in the pdf indexing, which fields of json i need to put in the hl.fl? cause all of json metadata didn't showing the file text. i must change the hl.fl=**content** with what metadata?
– adiyaksa kevin
Nov 22 at 18:06
First you'll have to make sure that you're actually extracting the text and keeping it stored when indexing the PDFs. You can see that by making a regular query to retrieve all documents (*:*
), and then see if you have the content there. If not, you might need to add a field namedcontent
to your schema, or usefmap.content=text
to index it to a different field (here namedtext
).
– MatsLindh
Nov 22 at 19:04
add a comment |
up vote
-1
down vote
favorite
up vote
-1
down vote
favorite
I want to make a search engine like Google with Solr, for indexing a pdf. Can I make my search engine same as Google or not ? If yes, I don't know how to do it. I've search, and found this, but not working. I'm too confused what to do first, edit the schema ? Edit the solrconfig
? What code I've to put in PHP and where the line is.
PS : I'm using Linux, Solr 7.5.0
php linux solr highlight
I want to make a search engine like Google with Solr, for indexing a pdf. Can I make my search engine same as Google or not ? If yes, I don't know how to do it. I've search, and found this, but not working. I'm too confused what to do first, edit the schema ? Edit the solrconfig
? What code I've to put in PHP and where the line is.
PS : I'm using Linux, Solr 7.5.0
php linux solr highlight
php linux solr highlight
edited Nov 22 at 16:27
Skynet
4,26452640
4,26452640
asked Nov 22 at 14:12
adiyaksa kevin
63
63
Start with the highlighting guide in the reference manual. You give these parameters as part of the query string.
– MatsLindh
Nov 22 at 14:29
i've tried like the reference, but in my search result just still the same. i added "&hl=true&hl.fl=content" in the code isn't working. mmm, in the pdf indexing, which fields of json i need to put in the hl.fl? cause all of json metadata didn't showing the file text. i must change the hl.fl=**content** with what metadata?
– adiyaksa kevin
Nov 22 at 18:06
First you'll have to make sure that you're actually extracting the text and keeping it stored when indexing the PDFs. You can see that by making a regular query to retrieve all documents (*:*
), and then see if you have the content there. If not, you might need to add a field namedcontent
to your schema, or usefmap.content=text
to index it to a different field (here namedtext
).
– MatsLindh
Nov 22 at 19:04
add a comment |
Start with the highlighting guide in the reference manual. You give these parameters as part of the query string.
– MatsLindh
Nov 22 at 14:29
i've tried like the reference, but in my search result just still the same. i added "&hl=true&hl.fl=content" in the code isn't working. mmm, in the pdf indexing, which fields of json i need to put in the hl.fl? cause all of json metadata didn't showing the file text. i must change the hl.fl=**content** with what metadata?
– adiyaksa kevin
Nov 22 at 18:06
First you'll have to make sure that you're actually extracting the text and keeping it stored when indexing the PDFs. You can see that by making a regular query to retrieve all documents (*:*
), and then see if you have the content there. If not, you might need to add a field namedcontent
to your schema, or usefmap.content=text
to index it to a different field (here namedtext
).
– MatsLindh
Nov 22 at 19:04
Start with the highlighting guide in the reference manual. You give these parameters as part of the query string.
– MatsLindh
Nov 22 at 14:29
Start with the highlighting guide in the reference manual. You give these parameters as part of the query string.
– MatsLindh
Nov 22 at 14:29
i've tried like the reference, but in my search result just still the same. i added "&hl=true&hl.fl=content" in the code isn't working. mmm, in the pdf indexing, which fields of json i need to put in the hl.fl? cause all of json metadata didn't showing the file text. i must change the hl.fl=**content** with what metadata?
– adiyaksa kevin
Nov 22 at 18:06
i've tried like the reference, but in my search result just still the same. i added "&hl=true&hl.fl=content" in the code isn't working. mmm, in the pdf indexing, which fields of json i need to put in the hl.fl? cause all of json metadata didn't showing the file text. i must change the hl.fl=**content** with what metadata?
– adiyaksa kevin
Nov 22 at 18:06
First you'll have to make sure that you're actually extracting the text and keeping it stored when indexing the PDFs. You can see that by making a regular query to retrieve all documents (
*:*
), and then see if you have the content there. If not, you might need to add a field named content
to your schema, or use fmap.content=text
to index it to a different field (here named text
).– MatsLindh
Nov 22 at 19:04
First you'll have to make sure that you're actually extracting the text and keeping it stored when indexing the PDFs. You can see that by making a regular query to retrieve all documents (
*:*
), and then see if you have the content there. If not, you might need to add a field named content
to your schema, or use fmap.content=text
to index it to a different field (here named text
).– MatsLindh
Nov 22 at 19:04
add a comment |
active
oldest
votes
active
oldest
votes
active
oldest
votes
active
oldest
votes
active
oldest
votes
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Some of your past answers have not been well-received, and you're in danger of being blocked from answering.
Please pay close attention to the following guidance:
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53432852%2fsolr-highlighting-like-google%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Start with the highlighting guide in the reference manual. You give these parameters as part of the query string.
– MatsLindh
Nov 22 at 14:29
i've tried like the reference, but in my search result just still the same. i added "&hl=true&hl.fl=content" in the code isn't working. mmm, in the pdf indexing, which fields of json i need to put in the hl.fl? cause all of json metadata didn't showing the file text. i must change the hl.fl=**content** with what metadata?
– adiyaksa kevin
Nov 22 at 18:06
First you'll have to make sure that you're actually extracting the text and keeping it stored when indexing the PDFs. You can see that by making a regular query to retrieve all documents (
*:*
), and then see if you have the content there. If not, you might need to add a field namedcontent
to your schema, or usefmap.content=text
to index it to a different field (here namedtext
).– MatsLindh
Nov 22 at 19:04