Re: [HACKERS] Write Ahead Logging for Hash Indexes

Mark Kirkwood Thu, 08 Sep 2016 19:51:43 -0700

On 09/09/16 07:09, Jeff Janes wrote:

On Wed, Sep 7, 2016 at 3:29 AM, Ashutosh Sharma <ashu.coe...@gmail.com<mailto:ashu.coe...@gmail.com>> wrote:
    > Thanks to Ashutosh Sharma for doing the testing of the patch and
    > helping me in analyzing some of the above issues.

    Hi All,

    I would like to summarize the test-cases that i have executed for
    validating WAL logging in hash index feature.

    1) I have mainly ran the pgbench test with read-write workload at the
    scale factor of 1000 and various client counts like 16, 64 and 128 for
    time duration of 30 mins, 1 hr and 24 hrs. I have executed this test
    on highly configured power2 machine with 128 cores and 512GB of RAM. I
    ran the test-case both with and without the replication setup.

    Please note that i have changed the schema of pgbench tables created
    during initialisation phase.

    The new schema of pgbench tables looks as shown below on both master
    and standby:

    postgres=# \d pgbench_accounts
       Table "public.pgbench_accounts"
      Column  |     Type      | Modifiers
    ----------+---------------+-----------
     aid      | integer       | not null
     bid      | integer       |
     abalance | integer       |
     filler   | character(84) |
    Indexes:
        "pgbench_accounts_pkey" PRIMARY KEY, btree (aid)
        "pgbench_accounts_bid" hash (bid)

    postgres=# \d pgbench_history
              Table "public.pgbench_history"
     Column |            Type             | Modifiers
    --------+-----------------------------+-----------
     tid    | integer                     |
     bid    | integer                     |
     aid    | integer                     |
     delta  | integer                     |
     mtime  | timestamp without time zone |
     filler | character(22)               |
    Indexes:
        "pgbench_history_bid" hash (bid)


Hi Ashutosh,
This schema will test the maintenance of hash indexes, but it willnever use hash indexes for searching, so it limits the amount of testcoverage you will get. While searching shouldn't generate novel typesof WAL records (that I know of), it will generate locking and timingissues that might uncover bugs (if there are any left to uncover, ofcourse).
I would drop the primary key on pgbench_accounts and replace it with ahash index and test it that way (except I don't have a 128 coremachine at my disposal, so really I am suggesting that you do this...)
The lack of primary key and the non-uniqueness of the hash indexshould not be an operational problem, because the built in pgbenchruns never attempt to violate the constraints anyway.
In fact, I'd replace all of the indexes on the rest of the pgbenchtables with hash indexes, too, just for additional testing.
I plan to do testing using my own testing harness after changing it toinsert a lot of dummy tuples (ones with negative values in thepseudo-pk column, which are never queried by the core part of theharness) and deleting them at random intervals. I think that none ofpgbench's built in tests are likely to give the bucket splitting andsqueezing code very much exercise.
Is there a way to gather statistics on how many of each type of WALrecord are actually getting sent over the replication link? The onlyway I can think of is to turn on wal archving as well as replication,then using pg_xlogdump to gather the stats.
I've run my original test for a while now and have not seen anyproblems. But I realized I forgot to compile with enable-casserts, toI will have to redo it to make sure the assertion failures have beenfixed. In my original testing I did very rarely get a deadlock (orsome kind of hang), and I haven't seen that again so far. It wasprobably the same source as the one Mark observed, and so the same fix.
Cheers,

Jeff

Yeah, good suggestion about replacing (essentially) all the indexes withhash ones and testing. I did some short runs with this type of schemayesterday (actually to get a feel for if hash performance vs btree wascompareable - does seem tp be) - but probably longer ones with higherconcurrency (as high as I can manage on a single socket i7 anyway) is agood plan. If Ashutosh has access to seriously large numbers of coresthen that is even better :-)


Cheers

Mark


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Write Ahead Logging for Hash Indexes

Reply via email to