Re: Advice on which PostgreSQL to use with ManifoldCF 2.6

2017-03-08 Thread Karl Wright
;>> Other things you can try: >>>> (1) Running the postgresql LT tests (ant run-LT-postgresql) against >>>> your postgresql installation; you will need to change the test code itself >>>> to allow it to create an instance for testing in that case; >>>

Re: Advice on which PostgreSQL to use with ManifoldCF 2.6

2017-03-08 Thread Karl Wright
may be the best place to start. Turning on > lock debugging will generate a large amount of log output but may be > necessary, especially if we want to figure out if the lock issue is a > downstream problem caused by the database integrity issue. > > > Thanks, > Karl > > >

Re: Advice on which PostgreSQL to use with ManifoldCF 2.6

2017-03-08 Thread Karl Wright
ed, Mar 8, 2017 at 5:11 AM, Standen Guy <guy.stan...@uk.fujitsu.com> wrote: > Hi Karl, > > I have upgraded to MCF 2.6 on Windows using Zookeeper for > synchronisation and PostgreSQL 9.3.16 for the database. > > I re-ran the pair of jobs ( 1 web crawl of local intranet site e

RE: Advice on which PostgreSQL to use with ManifoldCF 2.6

2017-03-06 Thread Standen Guy
Many Thanks Karl From: Karl Wright [mailto:daddy...@gmail.com] Sent: 06 March 2017 09:24 To: user@manifoldcf.apache.org Subject: Re: Advice on which PostgreSQL to use with ManifoldCF 2.6 Hi Guy, (1) I have no experience with PostgresSQL versions beyond 9.3, but I doubt you would have problems

RE: Advice on which PostgreSQL to use with ManifoldCF 2.6

2017-03-06 Thread Standen Guy
as the same user) What could cause these errors? Many Thanks, Guy From: Karl Wright [mailto:daddy...@gmail.com] Sent: 03 March 2017 17:35 To: user@manifoldcf.apache.org Subject: Re: Advice on which PostgreSQL to use with ManifoldCF 2.6 Hi Guy: It is expected that sometimes database deadlock

Re: MS Exhange support

2017-03-05 Thread Karl Wright
Hi Cihad, I've been able to connect to Exchange in the past; you need to use IMAP if I recall correctly. Karl On Sun, Mar 5, 2017 at 11:53 AM, Cihad Guzel wrote: > Hi, > > Does MCF Email connector support Microsoft Exchange? It doesn't support as > much as I can see. > >

RE: Advice on which PostgreSQL to use with ManifoldCF 2.6

2017-03-03 Thread Standen Guy
detail. Any view why this might be happening? Best Regards, Guy From: Karl Wright [mailto:daddy...@gmail.com] Sent: 03 March 2017 11:27 To: user@manifoldcf.apache.org Subject: Re: Advice on which PostgreSQL to use with ManifoldCF 2.6 Hi Guy, A issue with concurrent jobs is known for jobs

Re: Request-URI Too Long Error

2017-03-02 Thread Furkan KAMACI
Hi Karl, Single Solr instance is set and "Use the Extract Update Handler:" is clicked. Also, Update handler is /update/extract Kind Regards, Furkan KAMACI On Thu, Mar 2, 2017 at 3:52 PM, Karl Wright wrote: > Hi Furkan, > > The error is coming from Solr. How is your Solr

Re: Request-URI Too Long Error

2017-03-02 Thread Karl Wright
Hi Furkan, The error is coming from Solr. How is your Solr connection configured? If you are using /update/extract, your documents should be sent via POST, not GET. Karl On Thu, Mar 2, 2017 at 8:24 AM, Furkan KAMACI wrote: > Hi, > > When I test E-mail connector I

Re: [Webcrawler Connector] Feature for ignoring meta/rel robots tags/attributes

2017-02-27 Thread Markus Schuch
raised https://issues.apache.org/jira/browse/CONNECTORS-1392 Cheers, Markus Am 26.02.2017 um 00:43 schrieb Karl Wright: > I certainly have no objection. I would recommend, however, that the > default setting of this configuration option be set to "follow > metadata/rel", and that the

Re: Metadata adjuster

2017-02-22 Thread Karl Wright
Ok, I added a bit of extra info to the null output connector document ingestion simple history logging on trunk. This extra info summarizes attributes and their counts. I then created a job, with an attribute and fixed value, and left all other defaults in place. The output looks like this:

Re: Metadata adjuster

2017-02-22 Thread Karl Wright
Ah, sorry once again. It is definitely the update/extract handler in the log entry you sent. I am quite busy at the moment and will review this evening further. Thanks, Karl On Wed, Feb 22, 2017 at 11:21 AM, Karl Wright wrote: > Hi Marisol, > > The [INFO] log statement

Re: Metadata adjuster

2017-02-22 Thread Karl Wright
Hi Marisol, The [INFO] log statement you sent earlier was not an /update/extract request, and your Solr connection is set up to send to the Solr Cell /update/extract endpoint. Can you look again in your logs and find the *right* [INFO] statement? Thanks!! Karl On Wed, Feb 22, 2017 at 10:52

Re: Metadata adjuster

2017-02-22 Thread Karl Wright
Ah, never mind -- I need you instead to view the Solr connection, and paste that in an email. Basically, I want to be sure you are not inadvertantly disabling metadata to Solr. Thanks, Karl On Wed, Feb 22, 2017 at 10:39 AM, Karl Wright wrote: > This is how the email

Re: Metadata adjuster

2017-02-22 Thread Karl Wright
This is how the email appears here: >> 4. Bottom of Form Marisol Redondo Email: mredo...@revenue.ie Phone: 35428 Please note that Revenue cannot guarantee that any personal and sensitive data, sent in plain text via standard email, is fully secure. Customers who choose

Re: Metadata adjuster

2017-02-22 Thread Karl Wright
Hi Marisol, The [INFO] log entries indicate that your document has almost no metadata at all. But the Metadata Adjuster transformation connector is designed to do exactly what you want. Can you view your job, and cut and paste the View Job page into an email, so I can see how your metadata

Re: Metadata adjuster

2017-02-22 Thread Marisol Redondo
Hi Karl and thank you for this quick answer. I was reading the documentation of MCF 1.10 but I'm using MCF 2.5, sorry for the confusion, and I think this version is compatible with solr6. The pdf doesn't have any metadata or field called facetContentType, this is because I'd been trying to use

Re: Additional information from external database

2017-02-22 Thread Karl Wright
Hi Wilhelm, Documents that come from the file system connector have a URL that includes the file name, so you should have a way of finding the file name in your connector. There is also a RepositoryDocument file name field that you can get, and I believe that too will be set. Accessing

Re: Metadata adjuster

2017-02-21 Thread Karl Wright
Hi Marisol, Can you find the [INFO] entry in the Solr log for this document? That should help clear up any confusion. Also, for what it is worth, MCF 1.10 is not using a SolrJ that is up to date with Solr 6.x. That could be the source of the problem Is there any reason you are using a 1.x

Re: extract email attachment

2017-02-09 Thread Cihad Guzel
Thanks Karl. Regards, Cihad Guzel 2017-02-09 16:27 GMT+03:00 Karl Wright : > Hi Cihad, > The comparison should have been: > > mp.getCount() <= attachmentNumber > > As for changing ":" to "/", the real problem is that these should all be > ":"'s, including line 678. My

Re: extract email attachment

2017-02-09 Thread Karl Wright
Hi Cihad, The comparison should have been: mp.getCount() <= attachmentNumber As for changing ":" to "/", the real problem is that these should all be ":"'s, including line 678. My apologies. I've committed the changes. Thanks, Karl On Thu, Feb 9, 2017 at 8:15 AM, Cihad Guzel

Re: extract email attachment

2017-02-09 Thread Cihad Guzel
Hi Karl, mp.getCount() is 2 and attachmentNumber is '0' or '1' in my case. Regards, Cihad Guzel 2017-02-09 16:07 GMT+03:00 Cihad Guzel : > Hi Karl, > > I made some changes in the code and then the indexing was done > successfully. > > The changes are as follows: > > I have

Re: extract email attachment

2017-02-09 Thread Cihad Guzel
Hi Karl, I made some changes in the code and then the indexing was done successfully. The changes are as follows: I have removed these lines (lines: 772-775): if (mp.getCount() >= attachmentNumber) { activities.deleteDocument(documentIdentifier);

Re: extract email attachment

2017-02-07 Thread Cihad Guzel
Hi Karl, Does not 'else' part has to be proccessed when the email has an attachment? Although the email has an attachment, only the first part was processed. Also, I don't see the attachment's content in solr index. I edited the code line for testing as follow: if (attachmentIndex == null) {

Re: extract email attachment

2017-02-07 Thread Karl Wright
Here's the full code for this class: https://svn.apache.org/repos/asf/manifoldcf/trunk/connectors/email/connector/src/main/java/org/apache/manifoldcf/crawler/connectors/email/EmailConnector.java Karl On Tue, Feb 7, 2017 at 5:14 PM, Karl Wright wrote: > Hi Cihad, > > The

Re: extract email attachment

2017-02-07 Thread Karl Wright
Hi Cihad, The variable attachmentIndex is *supposed* to be null except when an attachment is being processed. The code should look like this: if (attachmentIndex == null) { // It's an email ... } else { // It's an attachment attachmentNumber =

Re: extract email attachment

2017-02-07 Thread Cihad Guzel
Hi Karl, I added LOG line for testing. It looks attachmentIndex is null. 2017-02-08 0:11 GMT+03:00 Karl Wright : > I attached a second patch (to apply on top of the first patch). Please > let me know if that fixes the issue. > > Karl > > > On Tue, Feb 7, 2017 at 3:59 PM,

Re: extract email attachment

2017-02-07 Thread Cihad Guzel
Hi Karl, I have an error as follow: FATAL 2017-02-07 23:56:09,483 (Worker thread '29') - Error tossed: For input string: "myFolder/test:" java.lang.NumberFormatException: For input string: "myFolder/test:< cadngpdgsxhewo0gdnul6s2sogusxua9mx2wxot23wi37hog...@mail.gmail.com>" at

Re: extract email attachment

2017-02-07 Thread Cihad Guzel
Thanks Karl, I will try it. Regards Cihad Guzel 2017-02-07 22:36 GMT+03:00 Karl Wright : > I've created a ticket and attached a patch to it. CONNECTORS-1375. > Please let me know if it works for you; if not, I'll fix what doesn't work. > > Karl > > > On Tue, Feb 7, 2017 at

Re: extract email attachment

2017-02-07 Thread Karl Wright
Correction: the only metadata attribute we set is the attachment(s) mimetype (as a multivalued field) -- this doesn't currently include the attachment data. Karl On Tue, Feb 7, 2017 at 1:14 PM, Karl Wright wrote: > Hi Cihad, > > The email connector is providing the

Re: extract email attachment

2017-02-07 Thread Karl Wright
Hi Cihad, The email connector is providing the attachment data unextracted to the output connector as metadata attribute data. There are no transformation connectors that look at this metadata. Solr cell also probably does not handle binary in random metadata attributes the proper way. The

Re: Notice: A bad bug with Solr Connector metadata has been found and fixed

2017-01-30 Thread Piergiorgio Lucidi
Hi Karl, thank you so much for your message. I hope to spend some time on testing other connectors. I'm actually involved in the development of the new CMIS Output Connector. Great job! Cheers, Piergiorgio 2017-01-25 18:38 GMT+01:00 Karl Wright : > Hi all, > > Just fixed

Re: PKIX error, when using https URL in RSS Connection

2017-01-26 Thread Karl Wright
Hi Joachim, The RSS connector by default should use "trust everything", which is why there's no selection for that in the UI. The code clearly has support for this in place. The only way it would not work is if the https connection you are trying to set up requires public key authentication, or

Re: CMIS Authority Test

2017-01-23 Thread Furkan KAMACI
Hi, Problem arises from here: When I check this URL: http://[host]:[port]/alfresco/cmisbrowse?url=http:// [host]:[port]/alfresco/cmisatom It says that: cmisVersionSupported 1.0 However, I tried the URL you mentioned and I can access them :) First of all, should I replace -default- with my

Re: CMIS Authority Test

2017-01-22 Thread Piergiorgio Lucidi
Unfortunately the outstanding implementation of CMIS in Alfresco can cause some doubts about which URL use :) As Jeff kindly mentioned again, you should use CMIS 1.1 with the URL similar to the following: http://[host]:[port]/alfresco/api/-default-/public/cmis/versions/1.1/atom

Re: CMIS Authority Test

2017-01-21 Thread Jeff Potts
That URL is deprecated in Alfresco 4.2.d. The correct URL can be found below: CMIS 1.0 For Alfresco 3.x : http://[host]:[port]/alfresco/service/cmis For Alfresco 4.0.x, Alfresco 4.1.x and Alfresco 4.2.a-c: http://[host]:[port]/alfresco/cmisatom For Alfresco 4.2.d-f, Alfresco 5.0 and Alfresco

Re: CMIS Authority Test

2017-01-18 Thread Piergiorgio Lucidi
Hi Furkan, the endpoint depends on the specific Alfresco version. For Alfresco 4.2.d-f, Alfresco 5.0 and Alfresco 5.1: http://[host]:[port]/alfresco/api/-default-/public/cmis/versions/1.1/atom For Alfresco <=4.1: http://[host]:[port]/alfresco/cmisatom For Alfresco 3:

Re: email connector filtering

2017-01-16 Thread Cihad Guzel
Hi Karl, I looked the email connector code lines but the "date" is not being used for filtering. I think it should be added. I created an issue : https://issues.apache.org/jira/browse/CONNECTORS-1368 and create a PR from github: https://github.com/apache/manifoldcf/pull/15 Regards Cihad Guzel

Re: Fw: Questions about the behaviour of a custom Manifoldcf repository connector

2017-01-13 Thread Karl Wright
Hi Vigi, For a description of the internals of ManifoldCF, you will want to read ManifoldCF In Action, available for free here: https://github.com/DaddyWri/manifoldcfinaction/tree/master/pdfs In order to model difficulty talking to a repository, what you want to do is throw a

Re: Documentum Aspect Attribute

2017-01-12 Thread Karl Wright
ary 11, 2017 8:55 PM > *To:* user@manifoldcf.apache.org > *Cc:* Sharnel Merdeck Pereira > *Subject:* Re: Documentum Aspect Attribute > > > > Hi Tamizh, > > > > For (1), can you please describe how you access "aspect attributes" using > DFC.

RE: Documentum Aspect Attribute

2017-01-11 Thread Tamizh Kumaran Thamizharasan
, January 11, 2017 8:55 PM To: user@manifoldcf.apache.org Cc: Sharnel Merdeck Pereira Subject: Re: Documentum Aspect Attribute Hi Tamizh, For (1), can you please describe how you access "aspect attributes" using DFC. For (2) I don't understand the problem. Thanks, Karl On Wed, Jan 11,

Re: Documentum Aspect Attribute

2017-01-11 Thread Karl Wright
Hi Tamizh, For (1), can you please describe how you access "aspect attributes" using DFC. For (2) I don't understand the problem. Thanks, Karl On Wed, Jan 11, 2017 at 10:08 AM, Tamizh Kumaran Thamizharasan < tthamizhara...@worldbankgroup.org> wrote: > Hi Karl, > > > > We can able to index

Re: [ANNOUNCE] Apache ManifoldCF 2.6 has now been released!

2016-12-31 Thread Rafa Haro
Well done team! El El sáb, 31 dic 2016 a las 14:40, Piergiorgio Lucidi < piergior...@apache.org> escribió: > Great work guys! > > Thank you so much for these contributions. > Happy New Year to all the team and all the ManifoldCF users :) > > Piergiorgio > > 2016-12-30 21:08 GMT+01:00 Karl

Re: [ANNOUNCE] Apache ManifoldCF 2.6 has now been released!

2016-12-31 Thread Piergiorgio Lucidi
Great work guys! Thank you so much for these contributions. Happy New Year to all the team and all the ManifoldCF users :) Piergiorgio 2016-12-30 21:08 GMT+01:00 Karl Wright : > This release contains a new Slack notification connector, as well as > numerous bug fixes.

Re: Sharepoint get ACL

2016-12-30 Thread Karl Wright
Also, FWIW, I can see that the user logins are in fact native and are in claims-based form, so your sharepoint is *definitely* configured to be claims-based. Thanks, Karl On Fri, Dec 30, 2016 at 8:52 AM, Karl Wright wrote: > Hi Cihad, > > I am almost certain your

Re: Sharepoint get ACL

2016-12-30 Thread Cihad Guzel
Hi Karl, I have changed the authority group as Native Sharepoint instead of Active Directory and I could see the allow tokens in Solr index as follow: "allow_token_document":["Authority+Group:Ui%3A0%23.w%7Clagom%5Cadministrator", "Authority+Group:GExcel+Services+Viewers",

Re: Sharepoint get ACL

2016-12-28 Thread Karl Wright
Hi Cihad, In your case, then, the connector is calling the "Users:GetUserCollectionFromGroup" SOAP method in the SharePoint API. This method is supposed to list the users that belong to the group, but I suspect that your SharePoint instance is not set up to work in that way, and that you should

Re: Sharepoint get ACL

2016-12-28 Thread Karl Wright
Hi Cihad, The code for looking for document ACLs is as follows: >> Object node = nodeList.get( i ); String mask = doc.getValue( node, "Mask" ); long maskValue = new Long(mask).longValue(); if ((maskValue & 1L) == 1L) { // Permission to view

Re: Sharepoint get ACL

2016-12-27 Thread Karl Wright
Hi Cihad, Some questions: (1) Have you selected "SharePoint 2013" in your SharePoint connection? (2) Have you installed the ManifoldCF SharePoint 2013 plugin on your SharePoint server? You will need to do both of these in order for SharePoint 2013 ACLs to work right. Thanks, Karl On Tue, Dec

Re: [JCIFS Connector] crawl job stop on access error

2016-12-09 Thread Karl Wright
Hi Julien, There's already code in place to treat this error as a ServiceInterruption, which means that the document will be retried. However, after it is retried a certain amount the code gives up and terminates the job. We could change this behavior to make it skip the document at that point.

Re: Empty Documentum metadata

2016-11-30 Thread Karl Wright
Type: 2 > > TYPE QR 0 0 0 > > NULL > > NULL > > NULL F > > > > I will check the DFC from my side meanwhile can you please provide us any > way to debug the application to analyze the documentum metadata and its > values fetched by manifoldcf. &g

RE: Empty Documentum metadata

2016-11-30 Thread Tamizh Kumaran Thamizharasan
, November 30, 2016 8:24 PM To: user@manifoldcf.apache.org Cc: Sharnel Merdeck Pereira Subject: Re: Empty Documentum metadata Hi Tamizh, I'm not sure what I am looking at here. Are you saying that the *names* of the metadata attributes are NULL? Or the *values*? If it is the values

Re: Empty Documentum metadata

2016-11-30 Thread Karl Wright
Hi Tamizh, I'm not sure what I am looking at here. Are you saying that the *names* of the metadata attributes are NULL? Or the *values*? If it is the values, then the issue is somewhere in DFC. If it is the names, I can help figure out what is going wrong. Thanks, Karl On Wed, Nov 30, 2016

RE: Empty Documentum metadata

2016-11-30 Thread Tamizh Kumaran Thamizharasan
Hi, We are trying to index documentum metadata to Solr, but in documentum-server log we can see manifoldcf is fetching empty documentum metadata as below. 12:50:40,575 DEBUG [RMI TCP Connection(22)-127.0.0.1] com.documentum.fc.client.impl.objectprotocol.ObjectProtocolV1 - deserializeType: 2

Re: Active Directory Integration

2016-11-29 Thread Karl Wright
It looks like your repository connection does not belong to the Authorization Group auth group. That's why it isn't working. Karl On Tue, Nov 29, 2016 at 9:55 AM, Furkan KAMACI wrote: > I'm testing Active Directory Integration. I index documents to Solr and I > can

Re: [MCF API] DELETE not available for some connectors

2016-11-28 Thread Karl Wright
Thanks for pointing this out. I've created CONNECTORS-1355 to address this issue, and I'll be committing the code shortly. Thanks again! Karl On Mon, Nov 28, 2016 at 9:09 AM, Julien Massiera < julien.massi...@francelabs.com> wrote: > Hi the MCF community, > > For the Datafari project, I'm

Re: Unnecessary Newline Characters and Metadata at Content

2016-11-26 Thread Shinichiro Abe
Hi, > Everything is OK when you directly send data to Solr without MFC. How did you send files? I just sent a pdf to Solr by curl, metadata is included to the content field value. command: $ curl 'http://localhost:8983/solr/collection1/update/extract?literal.id=doc1=true=content_t' -F

Re: Unnecessary Newline Characters and Metadata at Content

2016-11-26 Thread Furkan KAMACI
Hi Shinichiro, Yes, I can see the content with that way. However, beside the new line characters, there is metadata information prepended to content. Everything is OK when you directly send data to Solr without MFC. For example one of my content starts with it: *\n \n stream_size 298979 \n

Re: Unnecessary Newline Characters and Metadata at Content

2016-11-25 Thread Shinichiro Abe
Hi Furkan, Please see the previous mail[1] which may be the same issue. And as far as I know the new line chars will appear in any Tika version and you can see by json format in Solr. When you want to remove that, please use charfilter or updateprocessor in Solr. I think even when fields have new

Re: Unnecessary Newline Characters and Metadata at Content

2016-11-25 Thread Karl Wright
I am on vacation today and have other responsibilities. However, I believe Shinichiro Abe might be able to test this out. He redid the Solr integration for SolrJ 6.3. Thanks, Karl On Fri, Nov 25, 2016 at 1:54 PM, Furkan KAMACI wrote: > Hi Karl, > > Could you try to

Re: Unnecessary Newline Characters and Metadata at Content

2016-11-25 Thread Furkan KAMACI
Hi Karl, Could you try to test MFC with Solr? I cannot see content field either with Windows Shares or File System with Solr 4.x, 5.x, 6.x. Only Solr 4.x have content and it is as I defined. Code part of sending content as a stream may have some problems. Kind Regards, Furkan KAMACI On Fri,

Re: Unnecessary Newline Characters and Metadata at Content

2016-11-25 Thread Furkan KAMACI
Hi Karl, By the way, I've tried different versions of Solr and couldn't get content or got as I've explained. When I checkout the MFC trunk which uses Solr 6.3.0 and when I use Solr 6.3.0 as output connector I can see documents are indexed but I cannot even see "content" field. Kind Regards,

Re: Unnecessary Newline Characters and Metadata at Content

2016-11-25 Thread Karl Wright
Hi Furkan, The following code is used to set up a SolrJ object that is then later converted to a post request: >> private void buildExtractUpdateHandlerRequest( long length, InputStream is, String contentType, String contentName, ContentStreamUpdateRequest

Re: Unnecessary Newline Characters and Metadata at Content

2016-11-25 Thread Furkan KAMACI
Hi Karl, I used default values for Solr. At my Solr output connector "Use the Extract Update Handler" is clicked. Update handler is defined as: "/update/extract". There is no Tika content extractor defined at Job pipeline. I have WireShark captures and logs from both ManifoldCF and Solr. I can

Re: Unnecessary Newline Characters and Metadata at Content

2016-11-24 Thread Karl Wright
Is this being indexed via the extracting update handler? What does your pipeline look like? Is the tika extractor in the pipeline? Karl On Thu, Nov 24, 2016 at 12:52 PM, Furkan KAMACI wrote: > I've indexed a file via ManifoldCF to Solr which has a content starts

Re: Invalid date format for modified_date

2016-11-20 Thread Cihad Guzel
Thanks Karl. -- Kind Regards Cihad Güzel 2016-11-20 19:51 GMT+03:00 Karl Wright : > I just committed the fix. > > Thanks! > Karl > > > On Sun, Nov 20, 2016 at 11:47 AM, Furkan KAMACI > wrote: > >> Hi Karl, >> >> I verify that modified date is not

Re: Invalid date format for modified_date

2016-11-20 Thread Karl Wright
I just committed the fix. Thanks! Karl On Sun, Nov 20, 2016 at 11:47 AM, Furkan KAMACI wrote: > Hi Karl, > > I verify that modified date is not being sent as a valid ISO date. It has > such numbers prepended to it. > > Kind Regards, > Furkan KAMACI > > On Sun, Nov 20,

Re: Invalid date format for modified_date

2016-11-20 Thread Furkan KAMACI
Hi Karl, I verify that modified date is not being sent as a valid ISO date. It has such numbers prepended to it. Kind Regards, Furkan KAMACI On Sun, Nov 20, 2016 at 6:32 PM, Cihad Guzel wrote: > Hi Karl, > > I have created a pull request on github. You can see the problem

Re: Invalid date format for modified_date

2016-11-20 Thread Cihad Guzel
Hi Karl, I have created a pull request on github. You can see the problem from here: https://github.com/apache/manifoldcf/pull/10/commits/6a71a44ead5507c00302cb3a0a6a96d2bd2a02ce 2016-11-20 19:25 GMT+03:00 Karl Wright : > The code for formatting a date is here: > > >> >

Re: Creating Metadata Adjuster

2016-11-20 Thread Furkan KAMACI
I found how to make it work after I've asked the question :) On Sun, Nov 20, 2016 at 6:16 PM, Karl Wright wrote: > Ok, so I'm confused about what isn't working. Can you clarify? > > Karl > > > On Sun, Nov 20, 2016 at 11:00 AM, Furkan KAMACI > wrote:

Re: Invalid date format for modified_date

2016-11-20 Thread Karl Wright
The code for formatting a date is here: >> public static String formatISO8601Date(Date dateValue) { java.text.DateFormat df = new java.text.SimpleDateFormat("-MM-dd'T'HH:mm:ss.SSS'Z'", Locale.ROOT); df.setTimeZone(TimeZone.getTimeZone("GMT")); return df.format(dateValue);

Re: Creating Metadata Adjuster

2016-11-20 Thread Karl Wright
Ok, so I'm confused about what isn't working. Can you clarify? Karl On Sun, Nov 20, 2016 at 11:00 AM, Furkan KAMACI wrote: > I've just tested metadata adjuster and it works. I think that all of them > should work. > > On Sun, Nov 20, 2016 at 4:59 PM, Karl Wright

Re: Creating Metadata Adjuster

2016-11-20 Thread Furkan KAMACI
I've just tested metadata adjuster and it works. I think that all of them should work. On Sun, Nov 20, 2016 at 4:59 PM, Karl Wright wrote: > Have you tried any of the other transformation connectors > programmatically? For example, the null transformation connector? > >

Re: Creating Metadata Adjuster

2016-11-20 Thread Karl Wright
Have you tried any of the other transformation connectors programmatically? For example, the null transformation connector? Karl On Sun, Nov 20, 2016 at 9:57 AM, Furkan KAMACI wrote: > I was trying to do it programmatically and it works when you don't define > it as

Re: Creating Metadata Adjuster

2016-11-20 Thread Furkan KAMACI
I was trying to do it programmatically and it works when you don't define it as "stage_description". On Sun, Nov 20, 2016 at 4:54 PM, Karl Wright wrote: > I just tried it here; worked fine. Did you click the "Add" button after > filling in the metadata expression? > >

Re: Creating Metadata Adjuster

2016-11-20 Thread Karl Wright
I just tried it here; worked fine. Did you click the "Add" button after filling in the metadata expression? Thanks, Karl On Sun, Nov 20, 2016 at 8:08 AM, Furkan KAMACI wrote: > OK, it should not be description, specification :) > > On Sun, Nov 20, 2016 at 2:59 PM,

Re: Need to index documentum metadata

2016-11-17 Thread Karl Wright
'Is there any chance where the name of documentum metadata gets changed while reaching the output connector' Hi Tamizh, If you have a Metadata Adjuster transformation connector in your pipeline, then you might be mapping the metadata name to something else. But that is the only way. The

Re: Need to index documentum metadata

2016-11-17 Thread Tamizh Kumaran Thamizharasan
Hi Furkan, Currently I am using Solr 4.3 and documentum 7.1 I couldn't see any such errors in my Solr logs. Either these fields are omitted in output connector or these fields are not fetched from documentum. Thanks, Tamizh On Nov 18, 2016 12:06 PM, "Furkan KAMACI"

Re: Need to index documentum metadata

2016-11-17 Thread Furkan KAMACI
Hi Tamizh, You could see such a changed field if you have a managed scheme of Solr. Otherwise it could throw an error due to unknown field. What is the version of Solr and Documentum you use? Kind Regards, Furkan KAMACI On Nov 18, 2016 7:43 AM, "Tamizh Kumaran Thamizharasan" <

Re: Need to index documentum metadata

2016-11-17 Thread Tamizh Kumaran Thamizharasan
Hi Karl, DFC which we are using is compatible with the documentum. Is there any chance where the name of documentum metadata gets changed while reaching the output connector? Thanks, Tamizh On Nov 17, 2016 4:52 PM, "Karl Wright" wrote: > Hi Tamizh, > > The code which

Re: Need to index documentum metadata

2016-11-17 Thread Karl Wright
Hi Tamizh, The code which fetches attribute values uses straight access to object attributes via the DFC IDfPersistentObject API. Here's the actual code. "object" is an object of type IDfPersistentObject. >> int valueCount = object.getValueCount(attribute); String[] values =

Re: Need to index documentum metadata

2016-11-17 Thread Tamizh Kumaran Thamizharasan
Hi Karl, I can see the metadata under view page. Even after selecting all metadata check box, I am facing the same issue. Can you please share me steps to index documents from documentum? So that I can cross check the same. Thanks, Tamizh On Nov 17, 2016 4:20 PM, "Karl Wright"

Re: Need to index documentum metadata

2016-11-17 Thread Karl Wright
Hi Tamizh, Can you view the job and make sure when you do that the metadata fields you have selected shows up on the view page? The UI for this is a bit tricky and I have seen users not use it properly before. Thanks, Karl On Thu, Nov 17, 2016 at 3:27 AM, Tamizh Kumaran Thamizharasan <

Re: Need to index documentum metadata

2016-11-16 Thread Karl Wright
Hi Tamizh, I know of a number of people who are indexing documents and metadata from documentum. I also know of many people who are indexing documents and metadata into Solr. So I know this works. But there are a lot of places you will need to configure correctly for this to work the way you

Re: Are there any way to index metadata?

2016-11-14 Thread Shinichiro Abe
Hi Cihad, It should work the following: Parameter name=repositorytype, expression=windows String "windows" will be indexed to repositorytype field. Regards, Shinichiro Abe 2016-11-14 21:47 GMT+09:00 Cihad Guzel : > Hi > > I want to see resource type (sharpoint, database,

RE: SharePoint Connector: "Root site is unreachable"

2016-11-04 Thread ashutosh.rai
of time with the ability to crawl sharepoint documents (just basic crawl). Regards -Ashutosh From: Karl Wright<mailto:daddy...@gmail.com> Sent: 04 November 2016 15:19 To: Jeff Potts<mailto:jeffpott...@gmail.com> Cc: user@manifoldcf.apache.org<mailto:user@manifoldcf.apache.o

Re: SharePoint Connector: "Root site is unreachable"

2016-11-04 Thread Karl Wright
gt;> At least that's the configuration that worked for us without installing >> the MCF plugin. >> >> >> >> Hope this helps. >> >> -Ashutosh >> >> >> >> >> >> *From: *Karl Wright <daddy...@gmail.com> >> *Sen

Re: SharePoint Connector: "Root site is unreachable"

2016-11-04 Thread Jeff Potts
rked for us without installing > the MCF plugin. > > > > Hope this helps. > > -Ashutosh > > > > > > *From: *Karl Wright <daddy...@gmail.com> > *Sent: *04 November 2016 00:28 > *To: *user@manifoldcf.apache.org > *Subject: *Re: SharePoint Connector: "Root

RE: SharePoint Connector: "Root site is unreachable"

2016-11-03 Thread ashutosh.rai
Wright<mailto:daddy...@gmail.com> Sent: 04 November 2016 00:28 To: user@manifoldcf.apache.org<mailto:user@manifoldcf.apache.org> Subject: Re: SharePoint Connector: "Root site is unreachable" ** This mail has been sent from an external source ** Hi Jeff, The meaning

Re: SharePoint Connector: "Root site is unreachable"

2016-11-03 Thread Karl Wright
Hi Jeff, SharePoint's site model is as follows: -- there is a root site, usually with the URL http://somedomain.com -- there are "virtual" sites, which are also full-fledge sites, usually with the URL http://somedomain.com/site/site_name -- there are subsites of the root site, usually with URLs

Re: I don't see my file content in solr index

2016-10-31 Thread Cihad Guzel
Hi Karl, I try output connector with solr 4.4.0, solr 5.5.3 and solr 6.2.1 . If I use solr 4.4.0, everything looks OK in Simple History and in Solr index. If I use solr 5.5.3 and solr 6.2.1, everything looks OK in Simple History, but I don't see my files content in solr index. I could see

Re: webcrawler connector and dns lookups behind corporate http proxy

2016-10-11 Thread Markus Schuch
dcf.apache.org" <user@manifoldcf.apache.org> Betreff: Re: webcrawler connector and dns lookups behind corporate http proxy If the proxy is not authenticated, I think you can just put the IP address in as the machine name and it should work.  But that's all I can think of.   Karl    

Re: webcrawler connector and dns lookups behind corporate http proxy

2016-10-10 Thread Karl Wright
If the proxy is not authenticated, I think you can just put the IP address in as the machine name and it should work. But that's all I can think of. Karl On Mon, Oct 10, 2016 at 3:44 PM, Markus Schuch wrote: > Hi @ the lovely mcf community out there, > > in our setup we

How to re-index only certain documents?

2016-10-10 Thread Najman, Radko
Hello, we have custom transformation connectors to index from Documentum into Elasticserch. Due a bug in our connector some of the documents were indexed with wrong data. Is there a way how to reindex only affected documents? I tried to reset seeding version but it didn’t help. There is 're

RE: Custom Transfo Connector - Strange behaviour

2016-10-03 Thread Marc Emery
Hi Karl, You’re right, removing the associated records have forced the complete pipeline. I will investigate tomorrow on this and keep you informed. Thanks a lot marc De : Karl Wright [mailto:daddy...@gmail.com] Envoyé : lundi 3 octobre 2016 16:57 À : user@manifoldcf.apache.org Objet : Re

Re: ApacheCon Europe 2016 Seville

2016-08-31 Thread Antonio David Pérez Morales
Yes, I was thinking the same. So prepare a proposal for a presentation will be better in order to attract people who are not aware of Manifold to our community. I will try to prepare the proposal during the weekend. I'll keep you posted Regards

Re: ApacheCon Europe 2016 Seville

2016-08-31 Thread Karl Wright
As you probably know, a tutorial will require more work and original materials and planning on your part, and will also require some understanding of the goals of your likely audience. A presentation is the most straightforward and is usually meant for people who are not familiar with the project

Re: ApacheCon Europe 2016 Seville

2016-08-31 Thread Antonio David Pérez Morales
Perfect I will try to prepare the proposal during the weekend because the deadline is 9 September. The proposal has to contain the type (presentation, panel discussion, tutorial), the category (dev, business, etc), (biography), abstract about what will be presented, the audience and what they

Re: ApacheCon Europe 2016 Seville

2016-08-31 Thread Piergiorgio Lucidi
Hi Antonio, this is a great news! I'll try to understand if I can join the event :) I think that you can take a look at some presentations documented in the website [1]. But you can also see other presentations in my SlideShare account [2] e [3]. Hope this helps. Cheers, Piergiorgio [1] -

Re: Has anyone successfully populated Elastic Search properties from custom SharePoint Meta data fields?

2016-08-31 Thread Karl Wright
Aug 30, 2016 at 3:27 PM, Holtz, Michael (IT Consultant) < >> mho...@consultantemail.com> wrote: >> >>> Ah got it. I was wondering if it was something like that. >>> >>> >>> >>> Thanks, >>> >>> Mike H. >>

<    6   7   8   9   10   11   12   13   14   15   >