[
https://issues.apache.org/jira/browse/NUTCH-1098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Kolar updated NUTCH-1098:
---
Attachment: (was: patch-with-utf8-encoding.diff)
> better url-normalizer basic
>
[
https://issues.apache.org/jira/browse/NUTCH-1070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Kolar updated NUTCH-1070:
---
Attachment: (was: bash.c)
> Run nutch under native windows (no cygwin)
>
[
https://issues.apache.org/jira/browse/NUTCH-1070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Kolar updated NUTCH-1070:
---
Attachment: (was: chmod.c)
> Run nutch under native windows (no cygwin)
> ---
[
https://issues.apache.org/jira/browse/NUTCH-1070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Kolar updated NUTCH-1070:
---
Attachment: (was: nutch.bat)
> Run nutch under native windows (no cygwin)
> -
[
https://issues.apache.org/jira/browse/NUTCH-1194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Kolar updated NUTCH-1194:
---
Comment: was deleted
(was: locking should be done in setup/cleanup task. Currently if you kill
proce
[
https://issues.apache.org/jira/browse/NUTCH-1098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Kolar updated NUTCH-1098:
---
Attachment: patch-with-utf8-encoding.diff
Added support for encoding string to UTF-8 and then URL %es
[
https://issues.apache.org/jira/browse/NUTCH-1098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Kolar updated NUTCH-1098:
---
Attachment: (was: patch-urlnormalizer.diff)
> better url-normalizer basic
> -
[
https://issues.apache.org/jira/browse/NUTCH-1098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Kolar updated NUTCH-1098:
---
Attachment: patch-urlnormalizer.diff
Do not decode # and / characters during %XX decoding. Unit tests
[
https://issues.apache.org/jira/browse/NUTCH-1098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Kolar updated NUTCH-1098:
---
Attachment: (was: patch-urlnormalizer.diff)
> better url-normalizer basic
> -
[
https://issues.apache.org/jira/browse/NUTCH-1098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Kolar updated NUTCH-1098:
---
Attachment: (was: nutch.diff)
> better url-normalizer basic
> ---
>
>
[
https://issues.apache.org/jira/browse/NUTCH-1098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Radim Kolar updated NUTCH-1098:
---
Attachment: patch-urlnormalizer.diff
> better url-normalizer basic
> ---
11 matches
Mail list logo