I guess it is design. I think some operations are faster, if you count
bytes instead of characters. There could be an option to allow mawk
count characters, though.

** Description changed:

  $ echo ä | mawk '{print length($0)}'
- outputs 2. I expect 1. 
+ outputs 2. I expect 1.
  
  $ echo äo | mawk '{print match($0,"o")}'
  outputs 3. I expect 2.
  
  Probably this is due to the internal representation of UTF-8 characters;
- mawk counts bytes instead of characters.
+ mawk counts bytes instead of characters. gawk works similarly, if -b
+ option is used.
  
  ProblemType: Bug
  DistroRelease: Ubuntu 14.04
  Package: mawk 1.3.3-17ubuntu2
  ProcVersionSignature: Ubuntu 3.13.0-53.89-lowlatency 3.13.11-ckt19
  Uname: Linux 3.13.0-53-lowlatency x86_64
  ApportVersion: 2.14.1-0ubuntu3.11
  Architecture: amd64
  CurrentDesktop: XFCE
  Date: Sun Jun  7 15:52:26 2015
  Dependencies:
-  gcc-4.9-base 4.9.1-0ubuntu1
-  libc6 2.19-0ubuntu6.6
-  libgcc1 1:4.9.1-0ubuntu1
-  multiarch-support 2.19-0ubuntu6.6
+  gcc-4.9-base 4.9.1-0ubuntu1
+  libc6 2.19-0ubuntu6.6
+  libgcc1 1:4.9.1-0ubuntu1
+  multiarch-support 2.19-0ubuntu6.6
  EcryptfsInUse: Yes
  InstallationDate: Installed on 2014-09-21 (259 days ago)
  InstallationMedia: Ubuntu-Studio 14.04.1 LTS "Trusty Tahr" - Release amd64 
(20140722.1)
  SourcePackage: mawk
  UpgradeStatus: No upgrade log present (probably fresh install)

-- 
You received this bug notification because you are a member of Ubuntu
Touch seeded packages, which is subscribed to mawk in Ubuntu.
https://bugs.launchpad.net/bugs/1462737

Title:
  mawk counts characters incorrectly

Status in mawk package in Ubuntu:
  New

Bug description:
  $ echo ä | mawk '{print length($0)}'
  outputs 2. I expect 1.

  $ echo äo | mawk '{print match($0,"o")}'
  outputs 3. I expect 2.

  Probably this is due to the internal representation of UTF-8
  characters; mawk counts bytes instead of characters. gawk works
  similarly, if -b option is used.

  ProblemType: Bug
  DistroRelease: Ubuntu 14.04
  Package: mawk 1.3.3-17ubuntu2
  ProcVersionSignature: Ubuntu 3.13.0-53.89-lowlatency 3.13.11-ckt19
  Uname: Linux 3.13.0-53-lowlatency x86_64
  ApportVersion: 2.14.1-0ubuntu3.11
  Architecture: amd64
  CurrentDesktop: XFCE
  Date: Sun Jun  7 15:52:26 2015
  Dependencies:
   gcc-4.9-base 4.9.1-0ubuntu1
   libc6 2.19-0ubuntu6.6
   libgcc1 1:4.9.1-0ubuntu1
   multiarch-support 2.19-0ubuntu6.6
  EcryptfsInUse: Yes
  InstallationDate: Installed on 2014-09-21 (259 days ago)
  InstallationMedia: Ubuntu-Studio 14.04.1 LTS "Trusty Tahr" - Release amd64 
(20140722.1)
  SourcePackage: mawk
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/mawk/+bug/1462737/+subscriptions

-- 
Mailing list: https://launchpad.net/~touch-packages
Post to     : touch-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~touch-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to