Re: Proposal: doxygen-style comments for C++ client API (and may be other C++ components?)

Alexey Serbin Tue, 28 Jun 2016 11:00:12 -0700

Hi,

I re-formatted documentation in client.h file to be in doxygen-style.
Please take a look at it to get an idea on the suggested style, etc.:
  https://gist.github.com/alexeyserbin/faad98b2ec9959acdf7256ff7d0bf139
I also attached the file as is.


Sure, this is just an initial draft -- let's communicate on different
aspects and converge on the style conventions.

Please let me know if you see there is something worth changing.  Your
feedback is highly appreciated!


Thanks,

Alexey

On Thu, Jun 9, 2016 at 12:01 PM, Adar Dembo <a...@cloudera.com> wrote:

> We don't enforce much semantic consistency on our comments today. We
> follow the Google style guide, and cpplint.py (tied to our 'make lint'
> target) can enforce some stuff, but I don't think it does much, though
> I'm not sure; I haven't really played around with it. So yeah, if we
> enforce anything it's during code review, and that turns out to be
> tedious and time consuming for everyone involved. Take a look at
> cpplint.py; maybe it's got some checks that we're not running that'd
> yield better consistency.
>
>
>
>
> On Thu, Jun 9, 2016 at 11:51 AM, Alexey Serbin <aser...@cloudera.com>
> wrote:
> > Adar,
> >
> > I concur -- noisy and useless comments is an issue if not taken care of.
> >
> > BTW, what about current comments in the code -- how is semantic
> consistency
> > being enforced?  I would think it's more about code-review time; would
> not
> > expect much automation there.  As for the style, I also think
> style-checker
> > would be great to have.  Let's see what I can find out there.  I remember
> > there were some tools (vera++, etc.).  From that regard I really like Go
> > lang approach on this :)
> >
> > Let me provide a snippet/sample of how that doxygen-styled comments would
> > look like with current code and we can start from that.
> >
> >
> > Thanks,
> >
> > Alexey
> >
> >
> >
> > On Thu, Jun 9, 2016 at 10:21 AM, Adar Dembo <a...@cloudera.com> wrote:
> >
> >> I agree with Mike. In my experience Doxygen-style documentation can be
> >> very noisy in that it forces developers to write reams upon reams of
> >> "obvious" documentation (i.e. adding a line for a @return that is
> >> completely self-explanatory).
> >>
> >> I'm open to the idea of using it for public headers provided:
> >> - It's easy to draw the line between those files and the rest of the
> >> codebase.
> >> - We have some script to automate checking the Doxygen style on those
> >> files.
> >>
> >> I'd also like us spend some time discussing which aspects of Doxygen
> >> are worth implementing and which are excessive. Alexey, could you
> >> drive this discussion? Not sure if it makes more sense to do it over a
> >> sample patch (to e.g. client.h) or in a vacuum, whatever you think is
> >> best.
> >>
> >> On Thu, Jun 9, 2016 at 9:57 AM, Alexey Serbin <aser...@cloudera.com>
> >> wrote:
> >> > Hi All,
> >> >
> >> > Thank you for sharing your thoughts on this.
> >> >
> >> > From what I see the consensus is that for client API C++ headers it
> makes
> >> > sense to re-format in-code documentation to be in doxygen style.  So,
> I'm
> >> > thinking about discussing style-related questions with Mike Percy and
> >> > others, preparing a patch and sending it for review.
> >> >
> >> > I think it's worth publishing the auto-generated results (HTML) along
> >> with
> >> > client Java API docs.  Misty, what help is needed there?
> >> >
> >> >
> >> > Thanks,
> >> >
> >> > Alexey
> >> >
> >> > On Thu, Jun 9, 2016 at 9:37 AM, Misty Stanley-Jones <
> >> > mstanleyjo...@cloudera.com> wrote:
> >> >
> >> >> I'm +1 too. Do we want to publish these as HTML like we do with
> Javadoc
> >> >> too? If so, who wants to volunteer to help me figure that out?
> >> >>
> >> >> On Thu, Jun 9, 2016 at 8:28 AM, Sameer Abhyankar <
> >> sabhyan...@cloudera.com>
> >> >> wrote:
> >> >>
> >> >> > +1 for the client facing APIs!
> >> >> >
> >> >> > Sameer Abhyankar
> >> >> > Cloudera, Inc.
> >> >> > (404) 431-7806
> >> >> > sam...@cloudera.com
> >> >> >
> >> >> >
> >> >> >
> >> >> > On Wed, Jun 8, 2016 at 10:25 PM, Mike Percy <mpe...@apache.org>
> >> wrote:
> >> >> >
> >> >> > > Hey Alexey,
> >> >> > > Glad you brought it up. I'm inclined to agree that this would be
> >> >> helpful
> >> >> > > for users of the C++ client APIs.
> >> >> > >
> >> >> > > But... I'm hesitant to do it for the rest of the code base. It
> can
> >> be
> >> >> > > pretty noisy, requires ongoing maintenance, and honestly I don't
> >> think
> >> >> it
> >> >> > > is helpful if you have decent dev tools set up (good editor with
> >> good
> >> >> > > plugins, ack / ag, etc)
> >> >> > >
> >> >> > > If we adopt Doxygen, I think we should agree on a particular
> style
> >> and
> >> >> > > adhere to it.
> >> >> > >
> >> >> > > Mike
> >> >> > >
> >> >> > >
> >> >> > > On Wed, Jun 8, 2016 at 6:00 PM, Dan Burkert <d...@cloudera.com>
> >> wrote:
> >> >> > >
> >> >> > > > +1 from me for at least doing it in client.h and the few other
> >> public
> >> >> > > > header files.  These files have pretty much settled down, so it
> >> >> > shouldn't
> >> >> > > > be too onerous to keep the doxygen comments up to date once
> they
> >> are
> >> >> > > > created.
> >> >> > > >
> >> >> > > >  - Dan
> >> >> > > >
> >> >> > > > On Wed, Jun 8, 2016 at 4:36 PM, Alexey Serbin <
> >> aser...@cloudera.com>
> >> >> > > > wrote:
> >> >> > > >
> >> >> > > > > Hi All,
> >> >> > > > >
> >> >> > > > > Looking at current documentation for Kudu client API at
> >> >> getkudu.org
> >> >> > > > > you can see that Java API has auto-generated documentation
> >> >> > > > > but C++ API does not.
> >> >> > > > >
> >> >> > > > > Adding doxygen-style comments into C++ header files would
> >> >> > > > > allow to auto-generate API documentation for C++ client API
> as
> >> >> well.
> >> >> > > > >
> >> >> > > > > Assuming doxygen-formatted comments are not considered
> >> >> > > > > as a violation of current C++ coding style,
> >> >> > > > > would it make sense to introduce doxygen-style comments
> >> >> > > > > for C++ client API headers?
> >> >> > > > >
> >> >> > > > > If it's worth it, in-line documentation in other parts of the
> >> C++
> >> >> > code
> >> >> > > > base
> >> >> > > > > could be re-formatted into doxygen style as well.
> >> >> > > > >
> >> >> > > > > What do you think?
> >> >> > > > >
> >> >> > > > > Thank you!
> >> >> > > > >
> >> >> > > > >
> >> >> > > > > Best regards,
> >> >> > > > >
> >> >> > > > > Alexey
> >> >> > > > >
> >> >> > > >
> >> >> > >
> >> >> >
> >> >>
> >>
>

// Licensed to the Apache Software Foundation (ASF) under one
// or more contributor license agreements.  See the NOTICE file
// distributed with this work for additional information
// regarding copyright ownership.  The ASF licenses this file
// to you under the Apache License, Version 2.0 (the
// "License"); you may not use this file except in compliance
// with the License.  You may obtain a copy of the License at
//
//   http://www.apache.org/licenses/LICENSE-2.0
//
// Unless required by applicable law or agreed to in writing,
// software distributed under the License is distributed on an
// "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
// KIND, either express or implied.  See the License for the
// specific language governing permissions and limitations
// under the License.

#ifndef KUDU_CLIENT_CLIENT_H
#define KUDU_CLIENT_CLIENT_H

#include <cstdint>
#include <string>
#include <vector>

#include "kudu/client/resource_metrics.h"
#include "kudu/client/row_result.h"
#include "kudu/client/scan_batch.h"
#include "kudu/client/scan_predicate.h"
#include "kudu/client/schema.h"
#include "kudu/client/shared_ptr.h"
#ifdef KUDU_HEADERS_NO_STUBS
#include <gtest/gtest_prod.h>
#include "kudu/gutil/macros.h"
#include "kudu/gutil/port.h"
#else
#include "kudu/client/stubs.h"
#endif
#include "kudu/client/write_op.h"
#include "kudu/util/kudu_export.h"
#include "kudu/util/monotime.h"
#include "kudu/util/status.h"

#if _GLIBCXX_USE_CXX11_ABI
#error \
  "Kudu will not function properly if built with gcc 5's new ABI. " \
  "Please modify your application to set -D_GLIBCXX_USE_CXX11_ABI=0. " \
  "For more information about the new ABI, see " \
  "https://gcc.gnu.org/onlinedocs/libstdc++/manual/using_dual_abi.html.";
#endif

namespace kudu {

class ClientStressTest_TestUniqueClientIds_Test;
class LinkedListTester;
class PartitionSchema;

namespace client {

class KuduLoggingCallback;
class KuduScanToken;
class KuduSession;
class KuduStatusCallback;
class KuduTable;
class KuduTableAlterer;
class KuduTableCreator;
class KuduTabletServer;
class KuduValue;
class KuduWriteOperation;

namespace internal {
class Batcher;
class GetTableSchemaRpc;
class LookupRpc;
class MetaCache;
class RemoteTablet;
class RemoteTabletServer;
class WriteRpc;
} // namespace internal

/// Install a callback for internal client logging.
///
/// It can be invoked for a log event of any severity, across any KuduClient
/// instance.  Only the first invocation has effect; subsequent invocations are
/// a no-op.  Before a callback is registered, all internal client log events are
/// logged to the stderr.
///
/// @param [in] cb
///   Logging callback.  The caller must ensure that @c cb stays alive until
///   @see UninstallLoggingCallback() is called.
void KUDU_EXPORT InstallLoggingCallback(KuduLoggingCallback* cb);

/// Remove callback installed via @see InstallLoggingCallback().
///
/// Only the first invocation has any effect; subsequent invocations are
/// a no-op.
///
/// Should be called before unloading the client library.
void KUDU_EXPORT UninstallLoggingCallback();

/// Set the logging verbosity of the client library.
///
/// By default, the logging level is 0.  Logs become progressively more verbose
/// as the level is increased. Empirically, the highest verbosity level
/// used in Kudu is 6, which includes very fine-grained tracing information.
/// Most useful logging is enabled at level 1 or 2, with the higher levels
/// used only in rare circumstances.
///
/// Logs are emitted to stderr, or to the configured log callback
/// at @c SEVERITY_INFO.
///
/// This function may be called safely at any point during usage of the library.
///
/// @param [in] level
///   Logging level to set
void KUDU_EXPORT SetVerboseLogLevel(int level);

/// Set signal number for internal use
///
/// The Kudu client library uses signals internally in some cases.
/// By default, it uses SIGUSR2.  If your application makes use of SIGUSR2,
/// this advanced API can help workaround conflicts.
///
/// @param [in] signum
///   Signal number to use for internal 
/// @return Operation result status
Status KUDU_EXPORT SetInternalSignalNumber(int signum);

/// Get short version info
///
/// @return Single-line version string identifying the Kudu client
std::string KUDU_EXPORT GetShortVersionString();

/// Get detailed version-info string
///
/// @return Multi-line version string identifying the client,
///   including build time, etc.
std::string KUDU_EXPORT GetAllVersionInfo();

/// A "factory" for the @see KuduClient objects
///
/// This class is used to create instances of KuduClient class
/// with pre-set options/parameters.
///
class KUDU_EXPORT KuduClientBuilder {
 public:
  KuduClientBuilder();
  ~KuduClientBuilder();

  KuduClientBuilder& clear_master_server_addrs();

  /// Add RPC addresses of multiple masters.
  ///
  /// @param [in] addrs
  ///   RPC addresses of masterts to add
  /// @return Reference to the updated instance
  KuduClientBuilder& master_server_addrs(const std::vector<std::string>& addrs);

  /// Add an RPC address of a master
  ///
  /// At least one master is required.
  ///
  /// @param [in] addrs
  ///   RPC address of master server to add
  /// @return Reference to the updated instance
  KuduClientBuilder& add_master_server_addr(const std::string& addr);

  /// Set default timeout for administrative operations
  ///
  /// Using this method it's possible to modify the default timeout
  /// for operations like CreateTable, AlterTable, etc.
  /// By default it is 30 seconds.
  ///
  /// @param [in] timeout
  ///   Timeout value to set
  /// @return Reference to the updated instance
  KuduClientBuilder& default_admin_operation_timeout(const MonoDelta& timeout);

  /// Set the default timeout for individual RPCs. Optional.
  ///
  /// If not provided, defaults to 10 seconds.
  ///
  /// @param [in] timeout
  ///   Timeout value to set
  /// @return Reference to the updated instance
  KuduClientBuilder& default_rpc_timeout(const MonoDelta& timeout);

  /// Create client object.
  /// Note that KuduClients are shared amongst multiple threads and, as such,
  /// are stored in shared pointers.
  ///
  /// @param [out]
  ///   The newly created instance wrapped into a shared pointer.
  /// @return Operation status.  The return value may indicate
  ///   an error in the create operation, or a misuse of the builder;
  ///   in the latter case, only the last error is
  Status Build(sp::shared_ptr<KuduClient>* client);

 private:
  class KUDU_NO_EXPORT Data;

  // Owned.
  Data* data_;

  DISALLOW_COPY_AND_ASSIGN(KuduClientBuilder);
}; // class KuduClientBuilder

/// KuduClient -- a handle for a connection to a cluster.
///
/// The KuduClient class represents a connection to a cluster.  From the user
/// perspective, they should only need to create one of these in their
/// application, likely a singleton -- but it's not a singleton in Kudu in any
/// way.  Different KuduClient objects do not interact with each other -- no
/// connection pooling, etc.  Each KuduClient instance is sandboxed with no
/// global cross-client state.
///
/// In the implementation, the client holds various pieces of common
/// infrastructure which is not table-specific:
///   @li RPC messenger: reactor threads and RPC connections are pooled here
///   @li Authentication: the client is initialized with some credentials,
///     and all accesses through it share those credentials.
///   @li Caches: caches of table schemas, tablet locations, tablet server IP
///     addresses, etc are shared per-client.
///
/// In order to actually access data on the cluster, callers must first
/// create a KuduSession object using NewSession().  A KuduClient may
/// have several associated sessions.
///
/// @note This class is thread-safe.
///
/// @todo Cluster administration functions are likely to be in this class
///   as well.
class KUDU_EXPORT KuduClient : public sp::enable_shared_from_this<KuduClient> {
 public:
  ~KuduClient();

  /// Create a KuduTableCreator instance
  ///
  /// @return Pointer to newly created instance: it's the caller's
  ///   responsibility to free it.
  KuduTableCreator* NewTableCreator();

  /// Check whether CreateTable operation is in-progress
  ///
  /// @param [in] table_name
  ///   Name of the table in question
  /// @param [out] create_in_progress
  ///   The value is set only in case of success; it is @true iff
  ///   the operation is in progress.
  /// @return Operation status
  Status IsCreateTableInProgress(const std::string& table_name,
                                 bool *create_in_progress);

  /// Delete/drop a table
  ///
  /// @param [in] table_name
  ///   Name of the table to drop
  /// @return Operation status
  Status DeleteTable(const std::string& table_name);

  /// Creates a KuduTableAlterer
  ///
  /// @param [in] table_name
  ///   Name of the table to watch for
  /// @return Pointer to newly created instance: it's the caller's
  ///   responsibility to free it.
  KuduTableAlterer* NewTableAlterer(const std::string& table_name);

  /// Check if table alteration is in process
  ///
  /// @param [in] table_name
  ///   Name of the table in question
  /// @param [out] alter_in_progress
  ///   Set to @c true if an AlterTable operation is in-progress;
  ///   set to @c false otherwise
  /// @return Operation status
  Status IsAlterTableInProgress(const std::string& table_name,
                                bool *alter_in_progress);
  /// Get scheme for a table
  ///
  /// @param [in] table_name
  ///   Name of the table if question
  /// @param [out] schema
  ///   Raw pointer to the schema object: caller gets ownership
  /// @return Operation status
  Status GetTableSchema(const std::string& table_name,
                        KuduSchema* schema);

  Status ListTabletServers(std::vector<KuduTabletServer*>* tablet_servers);

  /// List only those tables whose names pass a substring match on 'filter'.
  ///
  /// @param [out] tables
  ///   Result tables 'tables' is appended to only on success.
  /// @param [in] filter
  ///   Substring-filter to use; empty sub-string filter matches any tables
  /// @return Status object for the operation
  Status ListTables(std::vector<std::string>* tables,
                    const std::string& filter = "");

  /// Check if the table given by 'table_name' exists.
  ///
  /// @param [in] table_name
  ///   Name of the table in question
  /// @param [out] exists
  ///   Set only on success: set to @true iff table exists
  /// @return Status object for the operation
  Status TableExists(const std::string& table_name, bool* exists);

  /// Open table with the given name
  ///
  /// If the table has not been opened before, then open the table
  /// with the given name.  If the table has not been opened before
  /// for this client, this will do an RPC to ensure that the table exists
  /// and look up its schema.
  ///
  /// @param [in] table_name
  ///   Name of the table in question
  /// @param [out] table
  ///   The result table
  /// @return Operation status
  ///
  /// @todo Should we offer an async version of this as well?
  /// @todo Probably should have a configurable timeout in KuduClientBuilder?
  Status OpenTable(const std::string& table_name,
                   sp::shared_ptr<KuduTable>* table);

  /// Create a new session for interacting with the cluster.
  ///
  /// This is a fully local operation (no RPCs or blocking).
  ///
  /// @return A new session object: caller is responsible for destroying it.
  sp::shared_ptr<KuduSession> NewSession();

  /// Policy with which to choose amongst multiple replicas.
  enum ReplicaSelection {
    LEADER_ONLY,      ///< Select the LEADER replica.

    CLOSEST_REPLICA,  ///< Select the closest replica to the client,
                      ///< or a random one if all replicas are equidistant.

    FIRST_REPLICA     ///< Select the first replica in the list.
  };

  /// @return @c true iff it's a multi-master session
  bool IsMultiMaster() const;

  const MonoDelta& default_admin_operation_timeout() const;
  const MonoDelta& default_rpc_timeout() const;

  /// Value for the latest observed timestamp when none has been observed
  /// or set.
  static const uint64_t kNoTimestamp;

  /// Get highest HybridTime timestamp observed by the client.
  ///
  /// The latest observed timestamp can be used to start a snapshot scan on a
  /// table which is guaranteed to contain all data written or previously read
  /// by this client.  Check @see KuduScanner for more details on timestamps.
  ///
  /// @return Highest HybridTime timestamp observed by the client
  uint64_t GetLatestObservedTimestamp() const;

  /// Sets the latest observed HybridTime timestamp
  ///
  /// This is only useful when forwarding timestamps between clients
  /// to enforce external consistency when using KuduSession::CLIENT_PROPAGATED
  /// external consistency mode.  To use this the user must obtain
  /// the HybridTime encoded timestamp from the first client with
  /// @see KuduClient::GetLatestObservedTimestamp() and then set it
  /// in the new client with this method.
  ///
  /// @param [in] ht_timestamp
  ///   Timestamp encoded in HybridTime format
  void SetLatestObservedTimestamp(uint64_t ht_timestamp);

 private:
  class KUDU_NO_EXPORT Data;

  friend class internal::Batcher;
  friend class internal::GetTableSchemaRpc;
  friend class internal::LookupRpc;
  friend class internal::MetaCache;
  friend class internal::RemoteTablet;
  friend class internal::RemoteTabletServer;
  friend class internal::WriteRpc;
  friend class KuduClientBuilder;
  friend class KuduScanner;
  friend class KuduScanTokenBuilder;
  friend class KuduTable;
  friend class KuduTableAlterer;
  friend class KuduTableCreator;

  FRIEND_TEST(kudu::ClientStressTest, TestUniqueClientIds);
  FRIEND_TEST(ClientTest, TestGetTabletServerBlacklist);
  FRIEND_TEST(ClientTest, TestMasterDown);
  FRIEND_TEST(ClientTest, TestMasterLookupPermits);
  FRIEND_TEST(ClientTest, TestNonCoveringRangePartitions);
  FRIEND_TEST(ClientTest, TestReplicatedMultiTabletTableFailover);
  FRIEND_TEST(ClientTest, TestReplicatedTabletWritesWithLeaderElection);
  FRIEND_TEST(ClientTest, TestScanFaultTolerance);
  FRIEND_TEST(ClientTest, TestScanTimeout);
  FRIEND_TEST(ClientTest, TestWriteWithDeadMaster);
  FRIEND_TEST(MasterFailoverTest, TestPauseAfterCreateTableIssued);

  KuduClient();

  // Owned.
  Data* data_;

  DISALLOW_COPY_AND_ASSIGN(KuduClient);
};

/// A helper class to create a new table with the desired options.
class KUDU_EXPORT KuduTableCreator {
 public:
  ~KuduTableCreator();

  /// Set name for the result table.
  ///
  /// This method must be called at an object.
  ///
  /// @todo Should it be a parameter for the constructor instead?
  ///
  /// @param [in] name
  ///   Name of the target table -- it is copied.
  /// @return Reference to the modified object
  KuduTableCreator& table_name(const std::string& name);

  /// Set the schema with which to create next table.
  ///
  /// @param [in] schema
  ///   Schema to use.  Must remain valid for the lifetime of the builder.
  ///   Must be non-NULL.
  /// @return Reference to the modified object
  KuduTableCreator& schema(const KuduSchema* schema);

  /// Add a set of hash partitions to the table.
  ///
  /// Tables must be created with either range, hash, or range and hash
  /// partitioning.
  ///
  /// For each set of hash partitions added to the table, the total number of
  /// table partitions is multiplied by the number of buckets.  For example,
  /// if a table is created with 3 split rows, and two hash partitions
  /// with 4 and 5 buckets respectively, the total number of table partitions
  /// will be 80 (4 range partitions * 4 hash buckets * 5 hash buckets).
  ///
  /// @param [in] columns
  ///   Names of columns to build partition
  /// @param [in] num_backets
  ///   Number of buckets for the hashing
  /// @return Reference to the modified object
  KuduTableCreator& add_hash_partitions(const std::vector<std::string>& columns,
                                        int32_t num_buckets);

  /// Add a set of hash partitions to the table (with seed)
  ///
  /// This method takes a seed value, which can be used to randomize the
  /// mapping of rows to hash buckets.  Setting the seed may provide some
  /// amount of protection against denial of service attacks when the hashed
  /// columns contain user provided values.
  ///
  /// @param [in] columns
  ///   Names of columns to build partition
  /// @param [in] num_backets
  ///   Number of buckets for the hashing
  /// @param [in] seed
  ///   Hash: seed for mapping rows to hash buckets
  /// @return Reference to the modified object
  KuduTableCreator& add_hash_partitions(const std::vector<std::string>& columns,
                                        int32_t num_buckets, int32_t seed);

  /// Set the columns on which the table will be range-partitioned.
  ///
  /// Tables must be created with either range, hash, or range and hash
  /// partitioning. To force the use of a single tablet (not recommended), call
  /// this method with an empty vector and set no split rows an no hash
  /// partitions.
  ///
  /// @param [in] columns
  ///   Name of columns to use for partitioning.  Every column must be
  ///   a part of the table's primary key.  If not set, or if called with
  ///   an empty vector, the table will be created without range partitioning.
  /// @return Reference to the modified object
  KuduTableCreator& set_range_partition_columns(const std::vector<std::string>& columns);

  /// Add a range partition split at the provided row.
  ///
  /// If no range split rows are added, no range pre-splitting is performed.
  ///
  /// @param [in] split_row
  ///   The row to use for partitioning.  If the row is missing a value
  ///   for any of the range partition columns, the logical minimum value
  ///   for that column type will be used by default.
  ///   The KuduObjectCreator object takes ownership of the parameter.
  /// @return Reference to the modified object
  KuduTableCreator& add_range_split(KuduPartialRow* split_row);

  /// @deprecated Use add_range_split() instead
  KuduTableCreator& split_rows(const std::vector<const KuduPartialRow*>& split_rows);

  /// Add a partition range.
  ///
  /// Add a partition range bound to the table with an inclusive lower bound
  /// and exclusive upper bound.
  ///
  /// The table creator takes ownership of the rows.  If either row is empty,
  /// then that end of the range will be unbounded.  If a range column is
  /// missing a value, the logical minimum value for that column type will
  /// be used as the default.
  ///
  /// Multiple range bounds may be added, but they must not overlap. All split
  /// rows must fall in one of the range bounds. The lower bound must be less
  /// than the upper bound.
  ///
  /// If this method is not called, the table's range is be unbounded.
  ///
  /// @return Reference to the modified object
  KuduTableCreator& add_range_bound(KuduPartialRow* lower_bound,
                                    KuduPartialRow* upper_bound);

  /// Set the number of replicas for each tablet in the table.
  ///
  /// @parameter [in] n_replicas
  ///   Number of replicas to set.  This should be an odd number.
  ///   If not provided (or if <= 0), falls back to the server-side default.
  /// @return Reference to the modified object
  KuduTableCreator& num_replicas(int n_replicas);

  /// Set the timeout for the operation.
  ///
  /// This includes any waiting after the create has been submitted
  /// (i.e if the create is slow to be performed for a large table,
  ///  it may time out and then later be successful).
  /// @param [in] timeout
  ///   Timeout to set.
  /// @return Reference to the modified object
  KuduTableCreator& timeout(const MonoDelta& timeout);

  /// Wait for the table to be fully created before returning.
  ///
  /// If not called, defaults to @c true.
  /// @parameter [in] wait
  ///   Whether to wait for completion of operations.
  /// @return Reference to the modified object
  KuduTableCreator& wait(bool wait);

  /// Create the table.
  ///
  /// @return Operation result status.  The return value may indicate an error
  ///   in the create table operation, or a misuse of the builder.
  ///   In the latter case, only the last error is returned.
  Status Create();

 private:
  class KUDU_NO_EXPORT Data;

  friend class KuduClient;

  explicit KuduTableCreator(KuduClient* client);

  // Owned.
  Data* data_;

  DISALLOW_COPY_AND_ASSIGN(KuduTableCreator);
};

/// A KuduTable represents a table on a particular cluster.
///
/// A KuduTable holds the current schema of the table.  Any given KuduTable
/// instance belongs to a specific KuduClient instance.
///
/// Upon construction, the table is looked up in the catalog (or catalog cache),
/// and the schema fetched for introspection.
///
/// @note This class is thread-safe.
class KUDU_EXPORT KuduTable : public sp::enable_shared_from_this<KuduTable> {
 public:
  ~KuduTable();

  /// @return Name of the table
  const std::string& name() const;

  /// Get the table's ID.
  ///
  /// This is an internal identifier which uniquely identifies a table.
  /// If the table is deleted and recreated with the same
  /// name, the ID will distinguish the old table from the new.
  ///
  /// @return Identifier string
  const std::string& id() const;

  /// Get scheme for the table
  ///
  /// @return Reference to the scheme object
  const KuduSchema& schema() const;

  // Create a new write operation for this table. It is the caller's
  // responsibility to free it, unless it is passed to KuduSession::Apply().
  //
  // The provided operations are:
  //   INSERT: Adds a new row. Fails if the row already exists.
  //   UPSERT: Adds a new row. If there's an existing row, updates it.
  //   UPDATE: Updates an existing row. Fails if the row does not exist.
  //   DELETE: Deletes an existing row. Fails if the row does not exist.
  KuduInsert* NewInsert();
  KuduUpsert* NewUpsert();
  KuduUpdate* NewUpdate();
  KuduDelete* NewDelete();

  /// Create a new comparision predicate
  ///
  /// This method creates a new instance of comparison predicate which
  /// can be used for scanners on this table object.
  ///
  /// @param [in] col_name
  ///   Name of column to use for comparision
  /// @param [in] op
  ///   Comparision operation to use
  /// @param [in] value
  ///   The type of value must correspond to the type of the column to which
  ///   the predicate is to be applied.  For example, if the given column is
  ///   any type of integer, the KuduValue should also be an integer, with its
  ///   value in the valid range for the column type.  No attempt is made
  ///   to cast between floating point and integer values,
  //    or numeric and string values.
  /// @return Raw pointer to instance of comparision predicate.  The caller owns
  ///   the result until it is passed into KuduScanner::AddConjunctPredicate().
  ///   The returned predicate object takes ownership over the @c value
  ///   Non-NULL is returned both in success and error cases.
  ///   In the case of an error (e.g. invalid column name), a non-NULL value
  ///   is still returned.  The error will be returned when attempting
  ///   to add this predicate to a KuduScanner.
  KuduPredicate* NewComparisonPredicate(const Slice& col_name,
                                        KuduPredicate::ComparisonOp op,
                                        KuduValue* value);

  /// Get the KuduClient object associated with the table
  KuduClient* client() const;

  /// Get partition scheme for the table
  const PartitionSchema& partition_schema() const;

 private:
  class KUDU_NO_EXPORT Data;

  friend class KuduClient;

  KuduTable(const sp::shared_ptr<KuduClient>& client,
            const std::string& name,
            const std::string& table_id,
            const KuduSchema& schema,
            const PartitionSchema& partition_schema);

  // Owned.
  Data* data_;

  DISALLOW_COPY_AND_ASSIGN(KuduTable);
}; // class KuduTable

/// @brief Alters an existing table based on the provided steps
///
/// Create a new instance of alterer using @see KuduClient::NewTableAlterer().
/// An example of usage:
/// @code
/// KuduTableAlterer* alterer = client->NewTableAlterer("table-name");
/// alterer->AddColumn("foo")->Type(KuduColumnSchema::INT32)->NotNull();
/// alterer->AlterColumn("bar")->Compression(KuduColumnStorageAttributes::LZ4);
/// Status s = alterer->Alter();
/// delete alterer;
/// @endcode
///
/// @todo there are 2 memory leaks in the example above: fix it or change
///   signature of the methods to return managed pointers instead of raw ones
///   (unique_ptr, shared_ptr, etc.)
class KUDU_EXPORT KuduTableAlterer {
 public:
  ~KuduTableAlterer();

  /// Create an object to perform table renaming operation
  ///
  /// @param [in] new_name
  ///   The new name for the table
  /// @return Raw pointer to this alterer object
  KuduTableAlterer* RenameTo(const std::string& new_name);

  /// Add a new column to the table
  ///
  /// When adding a column, you must specify the default value of the new
  /// column using @see KuduColumnSpec::DefaultValue(...).
  ///
  /// @param name
  ///   Name of the column do add
  /// @return Pointer to the result ColumnSpec object.  The caller
  ///   is responsible for freeing the object.
  KuduColumnSpec* AddColumn(const std::string& name);

  /// Alter an existing column
  ///
  /// @param [in] name
  ///   Name of the column to alter
  /// @return Pointer to the result ColumnSpec object.  The caller
  ///   is responsible for freeing the object.
  KuduColumnSpec* AlterColumn(const std::string& name);

  /// Drops an existing column from the table
  ///
  /// @param [in] name
  ///   Name of the column to alter
  /// @return Raw pointer to this alterer object
  KuduTableAlterer* DropColumn(const std::string& name);

  /// Set timeout for the alteration operation
  ///
  /// This includes any waiting after the alter has been submitted
  /// (i.e if the alter is slow to be performed on a large table,
  ///  it may time out and then later be successful).
  ///
  /// @param [in] timeout
  ///   Timeout to set
  /// @return Raw pointer to this alterer object
  KuduTableAlterer* timeout(const MonoDelta& timeout);

  /// Whether to wait for completion of alteration operations
  ///
  /// If set to true, every alteration operation returns control only after
  /// the opration is complete.  Otherwise, every operation returns immediately.
  /// By default (i.e. when an alteration object is created) it is set to true.
  ///
  /// @param [in] wait
  ///   Whether to wait for any alteration operation to complete before
  ///   returning control
  /// @return Raw pointer to this alterer object
  KuduTableAlterer* wait(bool wait);

  /// Alter the table
  ///
  /// @return The return value may indicate an error in the alter operation,
  ///   or a misuse of the builder (e.g. @see add_column() with
  ///   default_value=NULL).  In the latter case, only the last error
  ///   is returned.
  Status Alter();

 private:
  class KUDU_NO_EXPORT Data;
  friend class KuduClient;

  KuduTableAlterer(KuduClient* client,
                   const std::string& name);

  // Owned.
  Data* data_;

  DISALLOW_COPY_AND_ASSIGN(KuduTableAlterer);
};

/// This class represents an error which occurred in a given operation.
///
/// Using an instance of this class, it's possible to track error details
/// such as the operation which caused the error, along with whatever
/// the actual error was.
class KUDU_EXPORT KuduError {
 public:
  ~KuduError();

  /// @return The actual error which occurred
  const Status& status() const;

  /// @return The operation which failed
  const KuduWriteOperation& failed_op() const;

  /// Release the operation that failed: must only be called only once.
  ///
  /// @return Raw pointer to write operation object.  The caller
  ///   takes ownership of the returned object.
  KuduWriteOperation* release_failed_op();

  /// Check if there is a chance that the requested operation was successful
  ///
  /// In some cases, it's possible that the server did receive and successfully
  /// perform the requested operation, but the client can't tell whether or not
  /// it was successful. For example, if the call times out, the server may
  /// still succeed in processing at a later time.
  ///
  /// @return This function returns @c true if there is some chance that
  ///   the server did process the operation, and @c false if it can guarantee
  ///   that the operation did not succeed.
  bool was_possibly_successful() const;

 private:
  class KUDU_NO_EXPORT Data;

  friend class internal::Batcher;
  friend class KuduSession;

  KuduError(KuduWriteOperation* failed_op, const Status& error);

  // Owned.
  Data* data_;

  DISALLOW_COPY_AND_ASSIGN(KuduError);
};


/// Representation of a Kudu client session.
///
/// A KuduSession belongs to a specific KuduClient, and represents a context in
/// which all read/write data access should take place.  Within a session,
/// multiple operations may be accumulated and batched together for better
/// efficiency.  Settings like timeouts, priorities, and trace IDs are also set
/// per session.
///
/// A KuduSession's main purpose is for grouping together multiple data-access
/// operations together into batches or transactions.  It is important to note
/// the distinction between these two:
///
/// @li A batch is a set of operations which are grouped together in order to
///   amortize fixed costs such as RPC call overhead and round trip times.
///   A batch DOES NOT imply any ACID-like guarantees.  Within a batch, some
///   operations may succeed while others fail, and concurrent readers may see
///   partial results.  If the client crashes mid-batch, it is possible that
///   some of the operations will be made durable while others were lost.
/// @li In contrast, a transaction is a set of operations which are treated
///   as an indivisible semantic unit, per the usual definitions of database
///   transactions and isolation levels.
///
/// @note Kudu does not currently support transactions!  They are only mentioned
///   in the above documentation to clarify that batches are not transactional
///   and should only be used for efficiency.
///
/// KuduSession is separate from KuduTable because a given batch or transaction
/// may span multiple tables.  This is particularly important in the future when
/// we add ACID support, but even in the context of batching, we may be able to
/// coalesce writes to different tables hosted on the same server into the same
/// RPC.
///
/// KuduSession is separate from KuduClient because, in a multi-threaded
/// application, different threads may need to concurrently execute
/// transactions.  Similar to a JDBC "session", transaction boundaries will be
/// delineated on a per-session basis -- in between a "BeginTransaction" and
/// "Commit" call on a given session, all operations will be part of the same
/// transaction.  Meanwhile another concurrent Session object can safely run
/// non-transactional work or other transactions without interfering.
///
/// Additionally, there is a guarantee that writes from different sessions
/// do not get batched together into the same RPCs -- this means that
/// latency-sensitive clients can run through the same KuduClient object
/// as throughput-oriented clients, perhaps by setting the latency-sensitive
/// session's timeouts low and priorities high.  Without the separation
/// of batches, a latency-sensitive single-row insert might get batched along
/// with 10MB worth of inserts from the batch writer, thus delaying
/// the response significantly.
///
/// Though we currently do not have transactional support, users will be forced
/// to use a KuduSession to instantiate reads as well as writes.  This will make
/// it more straight-forward to add RW transactions in the future without
/// significant modifications to the API.
///
/// Users who are familiar with the Hibernate ORM framework should find this
/// concept of a Session familiar.
///
/// @note This class is not thread-safe except where otherwise specified.
class KUDU_EXPORT KuduSession : public sp::enable_shared_from_this<KuduSession> {
 public:
  ~KuduSession();

  /// Modes of operation flush
  enum FlushMode {
    /// Every write will be sent to the server in-band with the Apply()
    /// call.  No batching will occur. This is the default flush mode.  In this
    /// mode, the Flush() call never has any effect, since each Apply() call
    /// has already flushed the buffer.  This is the default for a session.
    AUTO_FLUSH_SYNC,

    /// Apply() calls will return immediately, but the writes will be sent
    /// in the background, potentially batched together with other writes
    /// from the same session.  If there is not sufficient buffer space,
    /// then Apply() will block for buffer space to be available.
    ///
    /// Because writes are applied in the background, any errors will be stored
    /// in a session-local buffer.  Call CountPendingErrors() or
    /// GetPendingErrors() to retrieve them.
    ///
    /// The Flush() call can be used to block until the buffer is empty.
    ///
    /// @todo Provide an API for the user to specify a callback to do their own
    ///   error reporting.
    ///
    /// @todo Specify which threads the background activity runs on
    ///   (probably the messenger IO threads?).
    ///
    /// @note This is not implemented yet, see KUDU-456
    AUTO_FLUSH_BACKGROUND,

    /// Apply() calls will return immediately, and the writes will not be
    /// sent until the user calls Flush().  If the buffer runs past the
    /// configured space limit, then Apply() will return an error.
    MANUAL_FLUSH
  };

  /// Set the flush mode
  ///
  /// REQUIRES: there should be no pending writes -- call Flush() first
  /// to ensure nothis is pending.
  ///
  /// @param [in] m
  ///   Flush mode to set
  /// @return Operation status
  Status SetFlushMode(FlushMode m) WARN_UNUSED_RESULT;

  /// The possible external consistency modes on which Kudu operates
  enum ExternalConsistencyMode {
    /// This is the default mode.
    ///
    /// The response to any write will contain a timestamp.  Any further calls
    /// from the same client to other servers will update those servers
    /// with that timestamp.  Following write operations from the same client
    /// will be assigned timestamps that are strictly higher, enforcing external
    /// consistency without having to wait or incur any latency penalties.
    ///
    /// In order to maintain external consistency for writes between
    /// two different clients in this mode, the user must forward the timestamp
    /// from the first client to the second by using
    /// KuduClient::GetLatestObservedTimestamp() and
    /// KuduClient::SetLatestObservedTimestamp().
    ///
    /// @warning
    ///   Failure to propagate timestamp information through back-channels
    ///   between two different clients will negate any external consistency
    ///   guarantee under this mode.
    CLIENT_PROPAGATED,

    /// The server will guarantee that write operations from the same or from
    /// other client are externally consistent, without the need to propagate
    /// timestamps across clients.  This is done by making write operations
    /// wait until there is certainty that all follow up write operations
    /// (operations that start after the previous one finishes)
    /// will be assigned a timestamp that is strictly higher, enforcing external
    /// consistency.
    ///
    /// @warning
    ///   Depending on the clock synchronization state of TabletServers this may
    ///   imply considerable latency.  Moreover operations in @c COMMIT_WAIT
    ///   external consistency mode will outright fail if TabletServer clocks
    ///   are either unsynchronized or synchronized but with a maximum error
    ///   which surpasses a pre-configured threshold.
    COMMIT_WAIT
  };

  /// Set external consistency mode for the session.
  ///
  /// @return Operation result status
  Status SetExternalConsistencyMode(ExternalConsistencyMode m) WARN_UNUSED_RESULT;

  /// Set the amount of buffer space used by this session for outbound writes
  ///
  /// The effect of the buffer size varies based on the flush mode of
  /// the session:
  /// @li AUTO_FLUSH_SYNC
  ///   since no buffering is done, this has no effect
  /// @li AUTO_FLUSH_BACKGROUND
  ///   if the buffer space is exhausted, then write calls will block until
  ///   there is space available in the buffer
  /// @li MANUAL_FLUSH
  ///   if the buffer space is exhausted, then write calls will return an error
  ///
  /// @param [in] size
  ///   Size of the buffer space to set (@todo specify the unit of measurement)
  /// @return Operation result status
  Status SetMutationBufferSpace(size_t size) WARN_UNUSED_RESULT;

  /// Set the timeout for writes made in this session
  ///
  /// @param [in] millis
  ///   Timeout to set in milliseconds; should be greater than 0.
  void SetTimeoutMillis(int millis);

  /// @todo
  ///   Add "doAs" ability here for proxy servers to be able to act on behalf of
  ///   other users, assuming access rights.

  /// Apply the write operation.
  ///
  /// The behavior of this function depends on the current flush mode.
  /// Regardless of flush mode, however, Apply() may begin to perform processing
  /// in the background for the call (e.g. looking up the tablet, etc).
  /// Given that, an error may be queued into the PendingErrors structure prior
  /// to flushing, even in @c MANUAL_FLUSH mode.
  ///
  /// In case of any error, which may occur during flushing or because
  /// the write_op is malformed, the write_op is stored in the session's error
  /// collector which may be retrieved at any time.
  ///
  /// @note This method is thread safe.
  ///
  /// @param [in] write_op
  ///   Operation to apply.  This method transfers the write_op's ownership
  ///   to the KuduSession.
  /// @return Operation result status
  Status Apply(KuduWriteOperation* write_op) WARN_UNUSED_RESULT;

  /// Apply the write operation in asynchronous mode.
  ///
  /// This method is similar to Apply(), except it never blocks.  Even in the
  /// flush modes that return immediately, 'cb' is triggered with the result.
  /// The callback may be called by a reactor thread, or in some cases
  /// may be called inline by the same thread which calls ApplyAsync().
  ///
  /// @param [in] cb
  ///   Callback to report the status of the operation.  The @c cb object
  ///   must remain valid until it's called.
  ///
  /// @note not yet implemented
  void ApplyAsync(KuduWriteOperation* write_op, KuduStatusCallback* cb);

  /// Flush any pending writes.
  ///
  /// Returns a bad status if there are any pending errors after the rows have
  /// been flushed. Callers should then use GetPendingErrors to determine which
  /// specific operations failed.
  ///
  /// In @c AUTO_FLUSH_SYNC mode, this has no effect, since every Apply() call
  /// flushes itself inline.
  ///
  /// In the case that the async version of this method is used, then
  /// the callback will be called upon completion of the operations which
  /// were buffered since the last flush.  In other words, in the following
  /// sequence:
  /// @code
  ///   session->Insert(a);
  ///   session->FlushAsync(callback_1);
  ///   session->Insert(b);
  ///   session->FlushAsync(callback_2);
  /// @endcode
  /// ... @c callback_2 will be triggered once @c b has been inserted,
  /// regardless of whether @c a has completed or not.
  ///
  /// @note This also means that, if FlushAsync is called twice in succession,
  /// with no intervening operations, the second flush will return immediately.
  /// For example:
  /// @code
  ///   session->Insert(a);
  ///   session->FlushAsync(callback_1); // called when 'a' is inserted
  ///   session->FlushAsync(callback_2); // called immediately!
  /// @endcode
  /// Note that, as in all other async functions in Kudu, the callback
  /// may be called either from an IO thread or the same thread which calls
  /// FlushAsync.  The callback should not block.
  ///
  /// @note This function is thread-safe.
  ///
  /// @return Operation result status
  Status Flush() WARN_UNUSED_RESULT;

  /// Flush any pending writes: the async version.
  /// @copydoc KuduSession::Flush()
  /// @param [in] cb
  ///   Callback to report on flush results. The @c cb must remain valid until
  ///   it is invoked.
  void FlushAsync(KuduStatusCallback* cb);

  /// Close the session.
  ///
  /// @return Operation result status.  Particularly, an error is returned
  ///   if there are unflushed or in-flight operations.
  Status Close() WARN_UNUSED_RESULT;

  /// Check if there are any pending operations in this session.
  ///
  /// @todo Maybe "incomplete" or "undelivered" is clearer?
  ///
  /// @note This function is thread-safe.
  ///
  /// @return @c true if there are operations which have not yet been delivered
  ///   to the cluster.  This may include buffered operations
  ///   (i.e. those that have not yet been flushed) as well as in-flight
  ///   operations
  ///   (i.e. those that are in the process of being sent to the servers).
  bool HasPendingOperations() const;

  /// Get number of buffered operations (not the same as 'pending').
  ///
  /// Note that this is different than HasPendingOperations() above,
  /// which includes operations which have been sent and not yet responded to.
  /// This is only relevant in @c MANUAL_FLUSH mode, where the result will not
  /// decrease except for after a manual flush, after which point it will be 0.
  /// In the other flush modes, data is immediately put en-route
  /// to the destination, so this will return 0.
  ///
  /// @note This function is thread-safe.
  ///
  /// @return The number of buffered operations.  These are operations that have
  ///   not yet been flushed -- i.e. they are not en-route yet.
  int CountBufferedOperations() const;

  /// Get error count for pending operations.
  ///
  /// Errors may accumulate when using the @c AUTO_FLUSH_BACKGROUND mode.
  ///
  /// @note This function is thread-safe.
  ///
  /// @return Total count of errors accumulated during the session.
  int CountPendingErrors() const;

  /// Get information on errors from previous calls.
  ///
  /// @param [out] errors
  ///   Pointer to the container to full with error info objects.  Caller takes
  ///   ownership of the returned errors in the container.
  /// @param [out] overflowed
  ///   If there were more errors than could be held in the session's error
  ///   storage, then @c overflowed is set to @c true.
  ///
  /// @note This function is thread-safe.
  void GetPendingErrors(std::vector<KuduError*>* errors, bool* overflowed);

  /// Get client for the session.
  ///
  /// @return Pointer to the client object.
  KuduClient* client() const;

 private:
  class KUDU_NO_EXPORT Data;

  friend class KuduClient;
  friend class internal::Batcher;
  explicit KuduSession(const sp::shared_ptr<KuduClient>& client);

  // Owned.
  Data* data_;

  DISALLOW_COPY_AND_ASSIGN(KuduSession);
};


/// This class represents a single scanner.
///
/// @note This class is not thread-safe, though different scanners on different
///   threads may share a single KuduTable object.
class KUDU_EXPORT KuduScanner {
 public:
  /// Read modes for scanners.
  enum ReadMode {
    /// When @c READ_LATEST is specified the server will always return committed
    /// writes at the time the request was received.  This type of read does not
    /// return a snapshot timestamp and is not repeatable.
    ///
    /// In ACID terms this corresponds to Isolation mode: "Read Committed"
    ///
    /// This is the default mode.
    READ_LATEST,

    /// When @c READ_AT_SNAPSHOT is specified the server will attempt to perform
    /// a read at the provided timestamp.  If no timestamp is provided
    /// the server will take the current time as the snapshot timestamp.
    /// In this mode reads are repeatable, i.e. all future reads at the same
    /// timestamp will yield the same data.  This is performed at the expense
    /// of waiting for in-flight transactions whose timestamp is lower than
    /// the snapshot's timestamp to complete, so it might incur
    /// a latency penalty.
    ///
    /// In ACID terms this, by itself, corresponds to Isolation mode "Repeatable
    /// Read".  If all writes to the scanned tablet are made externally
    /// consistent, then this corresponds to Isolation mode
    /// "Strict-Serializable".
    ///
    /// @note There are currently "holes", which happen in rare edge conditions,
    ///   by which writes are sometimes not externally consistent even when
    ///   action was taken to make them so.  In these cases Isolation may
    ///   degenerate to mode "Read Committed".  See KUDU-430.
    READ_AT_SNAPSHOT
  };

  /// Whether the rows should be returned in order.
  ///
  /// This affects the fault-tolerance properties of a scanner.
  enum OrderMode {
    /// Rows will be returned in an arbitrary order determined by the tablet
    /// server.  This is efficient, but unordered scans are not fault-tolerant
    /// and cannot be resumed in the case of tablet server failure.
    ///
    /// This is the default mode.
    UNORDERED,

    /// Rows will be returned ordered by primary key.  Sorting the rows imposes
    /// additional overhead on the tablet server, but means that scans are
    /// fault-tolerant and will be resumed at another tablet server
    /// in the case of a failure.
    ORDERED
  };

  /// Default scanner timeout.
  /// This is set to 3x the default RPC timeout:
  /// @see KuduClientBuilder::default_rpc_timeout()).
  enum { kScanTimeoutMillis = 30000 };

  /// Constructor -- it initializes the scanner upon construction.
  ///
  /// @param [in] table
  ///   The table to perfrom scan.  The given object must remain valid
  ///   for the lifetime of this scanner object.
  ///
  /// @todo Should table be a const pointer?
  explicit KuduScanner(KuduTable* table);
  ~KuduScanner();

  /// Set the projection for the scanner using column names.
  ///
  /// Set the projection used for the scanner by passing column names to read.
  /// This overrides any previous call to SetProjectedColumnNames() or
  /// SetProjectedColumnIndexes().
  ///
  /// @param [in] col_names
  ///   Column names to use for the projection.
  /// @return Operation result status
  Status SetProjectedColumnNames(const std::vector<std::string>& col_names)
    WARN_UNUSED_RESULT;

  /// Set the projection for the scanner using column indexes.
  ///
  /// Set the projection used for this scanner by passing the column indexes
  /// to read.
  /// This overrides any previous call to SetProjectedColumnNames() or
  /// SetProjectedColumnIndexes().
  ///
  /// @param [in] col_indexes
  ///   Column indices to use for the projection.
  /// @return Operation result status
  Status SetProjectedColumnIndexes(const std::vector<int>& col_indexes)
    WARN_UNUSED_RESULT;

  /// @deprecated User SetProjectedColumnNames() instead
  /// @return Operation result status
  Status SetProjectedColumns(const std::vector<std::string>& col_names)
    WARN_UNUSED_RESULT;

  /// @copydoc KuduScanTokenBuilder::AddConjunctPredicate()
  Status AddConjunctPredicate(KuduPredicate* pred) WARN_UNUSED_RESULT;

  /// @copydoc KuduScanTokenBuilder::AddLowerBound()
  Status AddLowerBound(const KuduPartialRow& key);

  /// Add lower bound
  ///
  /// @deprecated Use AddLowerBound() instead.
  ///
  /// @param [in] key
  ///   The primary key to use as an opaque slice of data.
  /// @return Operation result status
  Status AddLowerBoundRaw(const Slice& key);

  /// Add an upper bound (exclusive) primary key for the scan.
  ///
  /// If any bound is already added, this bound is intersected with that one.
  ///
  /// @param [in] key
  ///   The key to setup the upper bound.  The scanner makes a copy of the
  ///   parameter, the caller may free it afterward.
  /// @return Operation result status
  Status AddExclusiveUpperBound(const KuduPartialRow& key);

  /// Add an upper bound (exclusive) primary key for the scan.
  ///
  /// @deprecated Use AddExclusiveUpperBound() instead.
  ///
  /// @param [in] key
  ///   The encoded primary key is an opaque slice of data
  /// @return Operation result status
  Status AddExclusiveUpperBoundRaw(const Slice& key);

  /// Add a lower bound (inclusive) partition key for the scan.
  ///
  /// @note This method is unstable, and for internal use only.
  ///
  /// @param [in] partition_key
  ///   The scanner makes a copy of the parameter: the caller may invalidate
  ///   it afterward.
  /// @return Operation result status
  Status AddLowerBoundPartitionKeyRaw(const Slice& partition_key);

  /// Add an upper bound (exclusive) partition key for the scan.
  ///
  /// @note This method is unstable, and for internal use only.
  ///
  /// @param [in] partition_key
  ///   The scanner makes a copy of the parameter: the caller may invalidate
  ///   it afterward.
  /// @return Operation result status
  Status AddExclusiveUpperBoundPartitionKeyRaw(const Slice& partition_key);

  /// @copydoc KuduScanTokenBuilder::SetCacheBlocks
  Status SetCacheBlocks(bool cache_blocks);

  /// Begin scanning.
  ///
  /// @return Operation result status
  Status Open();

  /// Keep the current remote scanner alive.
  ///
  /// Keep the current remote scanner alive on the Tablet server
  /// for an additional time-to-live (set by a configuration flag on
  /// the tablet server).  This is useful if the interval in between
  /// NextBatch() calls is big enough that the remote scanner might be garbage
  /// collected (default ttl is set to 60 secs.).
  /// This does not invalidate any previously fetched results.
  ///
  /// @return Operation result status.  Particularly, this method returns
  ///   a non-OK status if the scanner was already garbage collected or if the
  ///   TabletServer was unreachable, for any reason.  Note that a non-OK
  ///   status returned by this method should not be taken as indication
  ///   that the scan has failed.  Subsequent calls to NextBatch() might
  ///   still be successful, particularly if SetFaultTolerant() has been called.
  Status KeepAlive();

  /// Close the scanner.
  ///
  /// Closing scanner releases resources on the server.  This call does not
  /// block, and will not ever fail, even if the server cannot be contacted.
  ///
  /// @note The scanner is reset to its initial state by this function.
  ///   You'll have to re-add any projection, predicates, etc if you want
  ///   to reuse this Scanner object.
  void Close();

  /// Check if there may be rows to be fetched from this scanner.
  /// @return @c true if there may be rows to be fetched from this scanner.
  ///   The method returns @c true provided there's at least one more tablet
  ///   left to scan, even if that tablet has no data
  ///   (we'll only know once we scan it).
  ///   It will also be @c true after the initially opening the scanner before
  ///   NextBatch is called for the first time.
  bool HasMoreRows() const;

  /// Get next batch for rows.
  ///
  /// Clears 'rows' and populates it with the next batch of rows
  /// from the tablet server.  A call to NextBatch() invalidates all previously
  /// fetched results which might now be pointing to garbage memory.
  ///
  /// @deprecated Use NextBatch(KuduScanBatch*) instead.
  ///
  /// @param [out] rows
  ///   Placeholder for the result.
  /// @return Operation result status
  Status NextBatch(std::vector<KuduRowResult>* rows);

  /// Fetch the next batch of results for this scanner.
  ///
  /// A single KuduScanBatch instance may be reused.  Each subsequent call
  /// replaces the data from the previous call, and invalidates any
  /// KuduScanBatch::RowPtr objects previously obtained from the batch.
  /// @param [out] batch
  ///   The placeholder for the result.
  /// @return Operation result status
  Status NextBatch(KuduScanBatch* batch);

  /// Get the KuduTabletServer that is currently handling the scan.
  ///
  /// More concretely, this is the server that handled the most recent
  /// Open() or NextBatch() RPC made by the server.
  ///
  /// @param [out] server
  ///   Placeholder for the result.
  /// @return Operation result status
  Status GetCurrentServer(KuduTabletServer** server);

  /// @return the cumulative resource metrics since the scan was started.
  const ResourceMetrics& GetResourceMetrics() const;

  /// Set the hint for the size of the next batch in bytes.
  ///
  /// @param [in] batch_size
  ///   The hint of batch size to set.  If setting to 0 before calling Open(),
  ///   it means that the first call to the tablet server won't return data.
  /// @return Operation result status
  Status SetBatchSizeBytes(uint32_t batch_size);

  /// Set the replica selection policy while scanning.
  ///
  /// @todo Kill this method in favor of a consistency-level-based API
  /// @param [in] selection
  ///   The policy to set.
  /// @return Operation result status
  Status SetSelection(KuduClient::ReplicaSelection selection)
    WARN_UNUSED_RESULT;

  /// Set the ReadMode.  Default is @c READ_LATEST.
  /// @return Operation result status
  Status SetReadMode(ReadMode read_mode) WARN_UNUSED_RESULT;

  /// @deprecated Use SetFaultTolerant() instead.
  /// @return Operation result status
  Status SetOrderMode(OrderMode order_mode) WARN_UNUSED_RESULT;

  /// @copydoc KuduScanTokenBuilder::SetFaultTolerant
  Status SetFaultTolerant() WARN_UNUSED_RESULT;

  /// @copydoc KuduScanTokenBuilder::SetSnapshotMicros
  Status SetSnapshotMicros(uint64_t snapshot_timestamp_micros) WARN_UNUSED_RESULT;

  /// @copydoc KuduScanTokenBuilder::SetSnapshotRaw
  Status SetSnapshotRaw(uint64_t snapshot_timestamp) WARN_UNUSED_RESULT;

  /// @copydoc KuduScanTokenBuilder::SetTimeoutMillis
  Status SetTimeoutMillis(int millis);

  /// @return The schema of the projection being scanned.
  KuduSchema GetProjectionSchema() const;

  /// @return String representation of this scan.
  std::string ToString() const;

 private:
  class KUDU_NO_EXPORT Data;

  friend class KuduScanToken;
  FRIEND_TEST(ClientTest, TestScanCloseProxy);
  FRIEND_TEST(ClientTest, TestScanFaultTolerance);
  FRIEND_TEST(ClientTest, TestScanNoBlockCaching);
  FRIEND_TEST(ClientTest, TestScanTimeout);

  // Owned.
  Data* data_;

  DISALLOW_COPY_AND_ASSIGN(KuduScanner);
};

/// A KuduScanToken describes a partial scan of a Kudu table limited to a single
/// contiguous physical location.  Using the KuduScanTokenBuilder, clients can
/// describe the desired scan, including predicates, bounds, timestamps, and
/// caching, and receive back a collection of scan tokens.
///
/// Each scan token may be separately turned into a scanner using
/// KuduScanToken::IntoKuduScanner, with each scanner responsible for a disjoint
/// section of the table.
///
/// Scan tokens may be serialized using the KuduScanToken::Serialize method and
/// deserialized back into a scanner using the
/// KuduScanToken::DeserializeIntoScanner method.  This allows use cases such as
/// generating scan tokens in the planner component of a query engine, then
/// sending the tokens to execution nodes based on locality, and then
/// instantiating the scanners on those nodes.
///
/// Scan token locality information can be inspected using the
/// KuduScanToken::TabletServers() method.
class KUDU_EXPORT KuduScanToken {
 public:

  ~KuduScanToken();

  /// Create a new scanner
  ///
  /// This method creates a new scanner, setting the result scanner's options
  /// according to the scan token.
  ///
  /// @param [out] scanner
  ///   The result scanner.  The caller owns the new scanner.  The scanner
  ///   must be opened before use.  The output parameter will not be set
  ///   if the returned status is an error.
  /// @return Operation result status
  Status IntoKuduScanner(KuduScanner** scanner) const WARN_UNUSED_RESULT;

  /// Get hint on candidate servers which may be hosting the source tablet.
  ///
  /// This method should be considered a hint, not a definitive answer,
  /// since tablet to tablet server assignments may change in response to
  /// external events such as failover or load balancing.
  ///
  /// @todo Is it worth keeping this method if outputs unreliable information?
  ///
  /// @return Vector of tablet servers who may be hosting the tablet which
  ///   this scan is retrieving rows from.
  const std::vector<KuduTabletServer*>& TabletServers() const;

  /// Serialize the token into a string.
  ///
  /// Deserialize with KuduScanToken::DeserializeIntoScanner().
  ///
  /// @param [out] buf
  ///   Result string to output the serialized token.
  /// @return Operation result status
  Status Serialize(std::string* buf) const WARN_UNUSED_RESULT;

  /// Create a new scanner and set the scanner options.
  ///
  /// @param [in] client
  ///   Client to bound the scanner to.
  /// @param [in] serialized_token
  ///   Token to pass the parameters in serialized form.
  /// @param [out] scanner
  ///   The result scanner.  The caller owns the new scanner.  The scanner
  ///   must be opened before use.  The scanner will not be set if
  ///   the returned status is an error.
  /// @return Operation result status
  static Status DeserializeIntoScanner(KuduClient* client,
                                       const std::string& serialized_token,
                                       KuduScanner** scanner) WARN_UNUSED_RESULT;

 private:
  class KUDU_NO_EXPORT Data;

  friend class KuduScanTokenBuilder;

  explicit KuduScanToken(Data* data);

  // Owned.
  Data* data_;

  DISALLOW_COPY_AND_ASSIGN(KuduScanToken);
};

/// Scan token builder for a table.
///
/// @note This class is not thread-safe.
class KUDU_EXPORT KuduScanTokenBuilder {
 public:

  /// Construct an instance of the class.
  ///
  /// @param [in] table
  ///   The table to work with.  The given object must remain valid
  ///   for the lifetime of the builder, and the tokens which it builds.
  explicit KuduScanTokenBuilder(KuduTable* table);
  ~KuduScanTokenBuilder();

  /// Set the column projection by passing the column names to read.
  ///
  /// Set the column projection used for this scanner by passing the column
  /// names to read.  A call of this method overrides any previous call to
  /// SetProjectedColumnNames() or SetProjectedColumnIndexes().
  ///
  /// @param [in] col_names
  ///   Column names for the projection
  /// @return Operation result status
  Status SetProjectedColumnNames(const std::vector<std::string>& col_names)
    WARN_UNUSED_RESULT;

  /// Set the column projection by passing the column indexes to read.
  ///
  /// Set the column projection used for this scanner by passing the column
  /// indices to read.  A call this method overrides any previous call to
  /// SetProjectedColumnNames() or SetProjectedColumnIndexes().
  ///
  /// @param [in] col_names
  ///   Column indices for the projection
  /// @return Operation result status
  Status SetProjectedColumnIndexes(const std::vector<int>& col_indexes)
    WARN_UNUSED_RESULT;

  /// Add a predicate for the scan.
  ///
  /// @param [in] pred
  ///   Predicate to set.  The KuduScanTokenBuilder instance takes ownership
  ///   of the parameter even if a bad Status is returned.  Multiple calls
  ///   of this method make the specified set of predicates to work in
  ///   conjunction -- i.e. all predicates must pass for a row to be returned.
  /// @return Operation result status
  Status AddConjunctPredicate(KuduPredicate* pred) WARN_UNUSED_RESULT;

  /// Add a lower bound (inclusive) primary key for the scan.
  ///
  /// If any bound is already added, this bound is intersected with that one.
  ///
  /// @param [in] key
  ///   Lower bound primary key to add.  The KuduScanTokenBuilder instance
  ///   does not take ownership of the parameter.
  /// @return Operation result status
  Status AddLowerBound(const KuduPartialRow& key) WARN_UNUSED_RESULT;

  /// Add an upper bound (exclusive) primary key.
  ///
  /// If any bound is already added, this bound is intersected with that one.
  ///
  /// @param [in] key
  ///   Upper bound primary key to add.  The KuduScanTokenBuilder instance
  ///   does not take ownership of the parameter.
  /// @return Operation result status
  Status AddUpperBound(const KuduPartialRow& key) WARN_UNUSED_RESULT;

  /// Set the block caching policy.
  ///
  /// @param [in] cache_blocks
  ///   If @c true, scanned data blocks will be cached in memory and
  ///   made available for future scans. Default is @c true.
  /// @return Operation result status
  Status SetCacheBlocks(bool cache_blocks) WARN_UNUSED_RESULT;

  /// Set the hint for the size of the next batch in bytes.
  /// If set to 0, the first call to the tablet server won't return data.
  /// @return Operation result status
  Status SetBatchSizeBytes(uint32_t batch_size) WARN_UNUSED_RESULT;

  /// Set the replica selection policy while scanning.
  ///
  /// @todo Kill this in favor of a consistency-level-based API
  /// @param [in] selection
  ///   Selection policy to set
  /// @return Operation result status
  Status SetSelection(KuduClient::ReplicaSelection selection)
    WARN_UNUSED_RESULT;

  /// Set the ReadMode.  Default is @c READ_LATEST.
  ///
  /// @param [in] read_mode
  ///   Read mode to set
  /// @return Operation result status
  Status SetReadMode(KuduScanner::ReadMode read_mode) WARN_UNUSED_RESULT;

  /// Make scans resumable at another tablet server if current one fails.
  ///
  /// Scans are by default non fault-tolerant, and scans will fail
  /// if scanning an individual tablet fails (for example, if a tablet server
  /// crashes in the middle of a tablet scan).  If this method is called,
  /// scans will be resumed at another tablet server in the case of failure.
  ///
  /// Fault-tolerant scans typically have lower throughput than non
  /// fault-tolerant scans.  Fault tolerant scans use @c READ_AT_SNAPSHOT mode:
  /// if no snapshot timestamp is provided, the server will pick one.
  ///
  /// @return Operation result status
  Status SetFaultTolerant() WARN_UNUSED_RESULT;

  /// Set snapshot timestamp for scans in @c READ_AT_SNAPSHOT mode.
  ///
  /// @param [in] snapshot_timestamp_micros
  ///   Timestamp to set in in microseconds since the Epoch.
  /// @return Operation result status
  Status SetSnapshotMicros(uint64_t snapshot_timestamp_micros)
    WARN_UNUSED_RESULT;

  /// Set snapshot timestamp for scans in @c READ_AT_SNAPSHOT mode (raw).
  ///
  /// @param [in] snapshot_timestamp
  ///   Timestamp to set in raw encoded form
  ///   (i.e. as returned by a previous call to a server).
  /// @return Operation result status
  Status SetSnapshotRaw(uint64_t snapshot_timestamp) WARN_UNUSED_RESULT;

  /// Set the maximum time that Open() and NextBatch() are allowed to take.
  ///
  /// @param [in] millis
  ///   Timeout to set (in milliseconds).  Must be greater than 0.
  /// @return Operation result status
  Status SetTimeoutMillis(int millis) WARN_UNUSED_RESULT;

  /// Build the set of scan tokens.
  ///
  /// The builder may be reused after this call.
  ///
  /// @param [out] tokens
  ///   Result set of tokens.  The caller takes ownership of the container
  ///   elements.
  /// @return Operation result status
  Status Build(std::vector<KuduScanToken*>* tokens) WARN_UNUSED_RESULT;

  /// @return A string representation of this scan.
  std::string ToString() const;

 private:
  class KUDU_NO_EXPORT Data;

  // Owned.
  Data* data_;

  DISALLOW_COPY_AND_ASSIGN(KuduScanTokenBuilder);
};

/// In-memory representation of a remote tablet server.
class KUDU_EXPORT KuduTabletServer {
 public:
  ~KuduTabletServer();

  /// Get the UUID of the tablet server.
  ///
  /// @return The UUID which is globally unique and guaranteed not to change
  ///   for the lifetime of the tablet server.
  const std::string& uuid() const;

  /// Get RPC address for the tablet server
  ///
  /// @return Hostname of the first RPC address that this tablet server
  ///   is listening on.
  const std::string& hostname() const;

 private:
  class KUDU_NO_EXPORT Data;

  friend class KuduClient;
  friend class KuduScanner;
  friend class KuduScanTokenBuilder;

  KuduTabletServer();

  // Owned.
  Data* data_;

  DISALLOW_COPY_AND_ASSIGN(KuduTabletServer);
};

} // namespace client
} // namespace kudu
#endif

Re: Proposal: doxygen-style comments for C++ client API (and may be other C++ components?)

Reply via email to