The problem is your home page, not robots.txt. When / is requested - the
following is served back, notice the javascript redirect: (the full file is
below)
----
function invokeWebApp() {
top.location.href = "http://www.theuniquepear.com/unique/index.jsp";
}
----
Search engines do not execute javascript are there are no links on the page
so search engines have no where to go. (Except someone else's site).
As much as I detest SEO companies, you might find it helpful to search for
one for some assistance.
<html>
<head>
<head>
<title>The Unique Pear | Unique Home Decor & Accessories</title>
<meta name="description" content="The Unique Pear is an
online b outique specializing in home decor &
accessories. Products include clocks, candl es, wall
decor, garden, lighting, bath and more.">
<meta name="keywords" content="The Unique Pear Timework clocks, lamps,
lamp shades, candles, aroma, aroma difuser, wall decor,
wall scounces, wrought iron, pitchers, bookstands, jaqua
bath products, candleholders">
<meta name="description" content="">
<meta name="keywords" content="">
</head>
<body bgcolor="#FFFFFF">
<script language = "javascript">
//<!--
function invokeWebApp() {
top.location.href = "http://www.theuniquepear.com/unique/index.jsp";
}
invokeWebApp();
// -->
</script>
hello
</body>
</html>
-Tim
Scott Purcell wrote:
I have had trouble getting search engines to see my site. I built it with
struts, and use some tags from the index.html page to get business logic, to
finally get to my page. The url is http://www.theuniquepear.com
Anyway, upon talking to some co-workers, they suggested I watch my access log,
so I can see what files they are indexing. I thought I had the access log
turned on for the site, and see when someone hits my web site, but as far as
the searchbots go, I only see this in my logs daily.
$ cat localhost_access_log.2006-02-07.txt | less
67.15.16.30 - - [07/Feb/2006:03:44:55 -0600] "GET /robots.txt HTTP/1.0" 404 985
67.15.16.30 - - [07/Feb/2006:03:46:21 -0600] "GET / HTTP/1.0" 200 844
67.15.16.30 - - [07/Feb/2006:03:51:57 -0600] "GET /robots.txt HTTP/1.0" 404 985
62.114.208.233 - - [07/Feb/2006:03:52:42 -0600] "GET
/unique/welcome.do?OVRAW=home%20decorating%20ideas&OVKEY=home
62.114.208.233 - - [07/Feb/2006:03:52:44 -0600] "GET /unique/includes/siteWide.css
HTTP/1.1" 200 15402
62.114.208.233 - - [07/Feb/2006:03:52:44 -0600] "GET /unique/images/header_pear.jpg
HTTP/1.1" 200 11227
I see the entry for robots.txt, but I have no idea where they are going, or
what they are doing.
I turned on access log like this in the server.xml like so:
<Valve className="org.apache.catalina.valves.AccessLogValve"
directory="logs" prefix="localhost_access_log." suffix=".txt"
pattern="common" resolveHosts="false"/>
And that is a snippet of the log from above.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]