[jira] [Commented] (CASSANDRA-16120) Add ability for jvm-dtest to grep instance logs

Alex Petrov (Jira) Tue, 15 Sep 2020 10:45:14 -0700


    [ 
https://issues.apache.org/jira/browse/CASSANDRA-16120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17196434#comment-17196434
 ]


Alex Petrov commented on CASSANDRA-16120:
-----------------------------------------

+1 on both patches with minor comments on github.

The only thing I'm thinking about is how we could implement something like 
streaming results. For example, if we had a Harry workload running for several 
hours, and we'd like to interrupt it if we see some exception anywhere in the 
logs. Probably we can have a poller that would search ahead, but at some point 
we can implement it with some in-memory streaming.

> Add ability for jvm-dtest to grep instance logs
> -----------------------------------------------
>
>                 Key: CASSANDRA-16120
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-16120
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Test/dtest/java
>            Reporter: David Capwell
>            Assignee: David Capwell
>            Priority: Normal
>              Labels: pull-request-available
>             Fix For: 4.0-beta
>
>          Time Spent: 3h 40m
>  Remaining Estimate: 0h
>
> One of the main gaps between python dtest and jvm dtest is python dtest 
> supports the ability to grep the logs of an instance; we need this capability 
> as some tests require validating logs were triggered.
> Pydocs for common log methods 
> {code}
> |  grep_log(self, expr, filename='system.log', from_mark=None)
> |      Returns a list of lines matching the regular expression in parameter
> |      in the Cassandra log of this node
> |
> |  grep_log_for_errors(self, filename='system.log')
> |      Returns a list of errors with stack traces
> |      in the Cassandra log of this node
> |
> |  grep_log_for_errors_from(self, filename='system.log', seek_start=0)
> {code}
> {code}
> |  watch_log_for(self, exprs, from_mark=None, timeout=600, process=None, 
> verbose=False, filename='system.log')
> |      Watch the log until one or more (regular) expression are found.
> |      This methods when all the expressions have been found or the method
> |      timeouts (a TimeoutError is then raised). On successful completion,
> |      a list of pair (line matched, match object) is returned.
> {code}
> Below is a POC showing a way to do such logic
> {code}
> package org.apache.cassandra.distributed.test;
> import java.io.BufferedReader;
> import java.io.FileInputStream;
> import java.io.IOException;
> import java.io.InputStreamReader;
> import java.io.UncheckedIOException;
> import java.nio.charset.StandardCharsets;
> import java.util.Iterator;
> import java.util.Spliterator;
> import java.util.Spliterators;
> import java.util.regex.Matcher;
> import java.util.regex.Pattern;
> import java.util.stream.Stream;
> import java.util.stream.StreamSupport;
> import com.google.common.io.Closeables;
> import org.junit.Test;
> import org.apache.cassandra.distributed.Cluster;
> import org.apache.cassandra.utils.AbstractIterator;
> public class AllTheLogs extends TestBaseImpl
> {
>    @Test
>    public void test() throws IOException
>    {
>        try (final Cluster cluster = init(Cluster.build(1).start()))
>        {
>            String tag = System.getProperty("cassandra.testtag", 
> "cassandra.testtag_IS_UNDEFINED");
>            String suite = System.getProperty("suitename", 
> "suitename_IS_UNDEFINED");
>            String log = String.format("build/test/logs/%s/TEST-%s.log", tag, 
> suite);
>            grep(log, "Enqueuing flush of tables").forEach(l -> 
> System.out.println("I found the thing: " + l));
>        }
>    }
>    private static Stream<String> grep(String file, String regex) throws 
> IOException
>    {
>        return grep(file, Pattern.compile(regex));
>    }
>    private static Stream<String> grep(String file, Pattern regex) throws 
> IOException
>    {
>        BufferedReader reader = new BufferedReader(new InputStreamReader(new 
> FileInputStream(file), StandardCharsets.UTF_8));
>        Iterator<String> it = new AbstractIterator<String>()
>        {
>            protected String computeNext()
>            {
>                try
>                {
>                    String s;
>                    while ((s = reader.readLine()) != null)
>                    {
>                        Matcher m = regex.matcher(s);
>                        if (m.find())
>                            return s;
>                    }
>                    reader.close();
>                    return endOfData();
>                }
>                catch (IOException e)
>                {
>                    Closeables.closeQuietly(reader);
>                    throw new UncheckedIOException(e);
>                }
>            }
>        };
>        return StreamSupport.stream(Spliterators.spliteratorUnknownSize(it, 
> Spliterator.ORDERED), false);
>    }
> }
> {code}
> And
> {code}
> @Test
>    public void test() throws IOException
>    {
>        try (final Cluster cluster = init(Cluster.build(1).start()))
>        {
>            String tag = System.getProperty("cassandra.testtag", 
> "cassandra.testtag_IS_UNDEFINED");
>            String suite = System.getProperty("suitename", 
> "suitename_IS_UNDEFINED");
>            //TODO missing way to get node id
> //            cluster.get(1);
>            String log = 
> String.format("build/test/logs/%s/TEST-%s-node%d.log", tag, suite, 1);
>            grep(log, "Enqueuing flush of tables").forEach(l -> 
> System.out.println("I found the thing: " + l));
>        }
>    }
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org

[jira] [Commented] (CASSANDRA-16120) Add ability for jvm-dtest to grep instance logs

Reply via email to