I have been researching everywhere and I just cannot seem to find out
how to make my documents searchable on Google without adding them
directly to my website. Here is the my goal... if someone could help
me to achieve it I would so GREATLY APPRECIATE it... as I am a bit
frustrated at this point.
Goal:
I need to make my document library searchable by Google. I've gone
down the route of using SharePoint to hold my documents but this has
been ruled out as we need a user name and password in order for you
retrieve the documents hence Google cannot get in. So we changed
routes and decided to use MONGO db which so far I am still not able to
get my PDF to be searchable ( mongo requires no login credentials,
however it is not on the DMZ, it lives behind the firewall but there
is a port to allow access into it...
Steps. 1st I am using ASP.net C#
2nd I have a pdf in mongo that is password free and Google accessible
3rd I have a site map that I’ve submitted to Google with the page
these pdfs are linked on.
4th. I am using a linkbutton to access the files from Mongo and from
one that is directly to a pdf within my site.
So far there here are results and details:
I have 4 test cases...
Case 1: PDF in mongo with the text in the pdf which seems to be only
select all (locked down) retrieving via a linkbutton ( this one is not
searchable)
Mongo pdf
<asp:LinkButton ID="lbClick" runat="server"
onclick="lbClick_Click">Click Me</asp:LinkButton>
Case 2: PDF with the text in the pdf able to copy any part of the
content being retrieve via a linkbutton (this one is now searchable on
Google, but is not an option for my site to house all the PDF’s within
the website, it was just a test to make sure we can search a pdf)
PDF on site
<asp:LinkButton ID="lbnOnSite" runat="server"
onclick="lbnOnSite_Click">Click Here</asp:LinkButton>
Case 3: PDF with text in the pdf able to copy any part of the content
being retrieved via an <a> tag ( this one oddly is not searchable and
I am very confused why, I would of thought if any this one would work
no problem after everything I have read)
test searchable pdf
<a href="/endobiop.pdf"> Click Me</a>
Case 4: PDF in Mongo text in the pdf able to copy any part of the
content being retrieved via a linkbutton. ( this one is not
searchable)
Mongo Test 2
<asp:LinkButton ID="LinkButton1" runat="server"
onclick="LinkButton1_Click">Click Me</asp:LinkButton>
The only other thing I would like to add is that out of all the
research I’ve done it seems as if it takes time on Google to do the
indexing of the files and that the PDF needs to be opened once on the
site… So the question I have is how do you have to wait? 1 week, 1
month, 1 year, a lifetime… basically I have no idea when I know my
test failed.
--
You received this message because you are subscribed to the Google
Groups "Google AJAX APIs" group.
To post to this group, send email to
[email protected]
To unsubscribe from this group, send email to
[email protected]
To view this message on the web, visit
http://groups.google.com/group/google-ajax-search-api?hl=en_US
For more options, visit this group at
http://groups.google.com/group/google-ajax-search-api?hl=en?hl=en