Push to talk does not work in Wayland / Gnome3 #3243

itsrachelfish · 2017-10-04T01:58:47Z

I recently upgraded my OS to use Gnome 3.24.2 which now uses Wayland instead of X. Gnome has used Wayland by default since version 3.16, which was released in 2015.

Push to talk works as expected when Mumble has focus and also works when certain applications like Firefox or my text editor are in focus.

However, push to talk does not work when a Wayland native application is in focus, like gnome-terminal or gnome-files.

If I start pressing push to talk with Mumble focused and switch tabs to a Wayland native application, Mumble does not detect when I stop pressing the push to talk key and it stays open until I switch back.

I've tried using both keyboard and mouse hotkeys, but the behavior is the same.

I couldn't find any other issues related to this bug on GitHub, but I did find this similar bug report in the Red Hat bug tracker: https://bugzilla.redhat.com/show_bug.cgi?id=1417576

Any further information about what could be causing this bug would be appreciated, thank you!

sardemff7 · 2017-10-05T17:29:37Z

Hi,

Not allowing clients to sniff on the input when they don’t have focus is a feature in Wayland, and it will be kept that way. The use can then trust their input to go where it is expected.
Since X11 clients are actually sharing one Wayland connection, through the Xwayland server, it actually means that focusing an X11 client will give input to all of them. That is what you are experiencing with some clients allowing PTT to work.

There are four cases of global bindings that I am aware of:

WM bindings to manage windows
xbindkeys daemon and friends, with two main usages that I know about:
- launching stuff
- translating from one binding to another (like, a mouse button to a key, allowing to use mouse buttons with applications with keyboard-only bindings)
Media players
Push-to-talk

Under Wayland, the first case will work as usual, since the compositor has control over the input. The second case would ideally be split between the first one (for launching stuff) and a non-problem, see below. Also, the average user doesn’t need such tweaks in the first place.
The Media player case is mostly solved in DEs by having the compositor handling media keys and sending e.g. MPRIS commands.
We’re left with the fourth case (and half of the second one) that DEs had, until now, little interest in.

So the ideal protocol would:

not leak keyboard information
allow non-keyboard bindings

To have both, I went with an action-based protocol. I sent a proposal a few years ago and more recently, I made a cleaner one to allow for global action bindings.
To work as expected, clients (or toolkits) wanting to support global bindings would have to implement it. On the other side, compositors would have to implement the protocol and provide their user a way to link (key, mouse, touch, mind-control) bindings to said actions.

The action are namespaced, and you are expected to use fallbacks. For Mumble, it means you would ask for the "mumble/push-to-talk voip/push-to-talk" action. The user could then have a Mumble-specific binding (or not), and a generic binding. Let’s say Teamspeak is running too, pressing the key for voip/push-to-talk would lead to the action event being sent to either Mumble or Teamspeak (for example, based on the last focused one).

(I should probably write all that to the mailing list, for the record if nothing else.)

If there is any interest from Mumble developers for this solution, I am willing to implement the compositor side for Weston (and all libweston-based compositors) as well as WLC-based compositors (at least via an LD_PRELOAD hack), and I may convince wlroots developers too (to be used by Sway and way-cooler, two important tiling compositors).

mkrautz · 2017-11-26T15:24:03Z

Sorry for taking so long to respond.

@itsrachelfish Mumble currently defaults to using XInput2 to "sniff" keypresses and mouse events. However, it can also use raw evdev.

Usually, the default device node permissions OSes allow you to read mouse clicks, but keyboards are off-limits (obviously).

However, if you configure the device node permissions, Mumble can happily use your raw evdev device nodes to read key events, which should work on Wayland, or everything, really.

The setting is "shortcut/linux/evdev/enable". To configure it on Linux, you'd add

[shortcut/linux/evdev]
enable=true

to $HOME/.config/Mumble/Mumble.conf.

However, there is a bit of a misbehavior right now, where Mumble will fall back to XInput2 when no keyboards can be opened via evdev. This behavior is from back when evdev was our default.
Now, it makes more sense for Mumble to keep using evdev if it is enabled, and warn the user if no keyboard device nodes are available.

That's the workaround until we figure something proper out for Wayland.

mkrautz · 2017-11-26T15:27:45Z

Bug for evdev misbehavior is at #3269

mkrautz · 2017-11-26T15:36:10Z

@sardemff7 I think your proposal (the August-dated one) looks solid. Is there an implementation of this stuff anywhere, or is it just a spec for now?

mkrautz · 2017-11-26T15:51:43Z

@sardemff7:

...But I'm not sure it's enough in its current incarnation. At least it doesn't map fully onto the way global shortcuts work in Mumble currently. (And obviously: it doesn't need to. We're willing to use different UI if we need to, for different platforms.)

It seems like, if we were to use the current API, Mumble would simply bind to "mumble/push-to-talk", "mumble/volume-up", "mumble/volume-down", etc. -- and we wouldn't be able to show the actual bound key to the user, because that part is handled by the compositor. That means the UI for shortcuts would be less than ideal for users.

Perhaps we need a way to query which keys/events are bound to an action, so we can show that to the user?

How would the flow work from a user perspective? Do you configure the actions outside the app itself?
Otherwise, how do you map a keypress/event to an action using the exposed API?

Kind of ties into my previous comment, but I suppose the current API requires us to bind to the actions on startup, correct? If we don't, we won't receive notifications when the action is triggered?

mkrautz · 2017-11-26T16:02:24Z

Usually, the default device node permissions OSes allow you to read mouse clicks, but keyboards are off-limits (obviously).

Hmmm. Actually, on my Ubuntu 16.10 test VM, /dev/input/mice (or mouseXX) are also only root readable (or rw by group 'input').

sardemff7 · 2017-11-26T18:46:23Z

Not allowing sniffing on evdev directly is also a goal (that most OSes now do right because device nodes are root, as you noticed).

For now, there is no code behind my proposal, because nobody actually had (code-backed) interest in it. If Mumble is willing to implement it, I can make a Weston implementation, but I think at least a GNOME or KDE implementation would be needed to really push that protocol forward.

The client (Mumble) binds actions at startup, as you guessed.

As for the UI/UX, it would be compositor-dependent. Each DE/compositor would have its own UI (for Weston, it would just be the configuration file, at first, but writing a GUI tool is not really hard to do either). I can imagine GNOME and KDE having a new thing in their control panel, with a list of action strings and the corresponding binding(s).
The client could either have no UI at all (or a message “see your compositor configuration”) or we add a request in the protocol to make the compositor pop the UI directly, even filtered to the action bound by the client.

Mumble would bind actions in the mumble/ and voip/ namespaces, and receive events for the relevant one. Say you have Mumble and Teamspeak running at the same time, the compositor would decide which one to send the events to (for example, last focused).

detrout · 2018-11-07T06:42:08Z

I was impacted by this and came up with a possibly solution.

I extended mumble's dbus api to include startTalk znd stopTalk calls.

Then I wrote a small program that looked for the mouse button event I was using for push to talk that I could run with root permissions that could then send startTalk on mouse button down and stopTalk on mouse button up.

In the long term the desktops need to define some wayland accessibility system where users can bind global hot keys, and having such a thing send dbus messages seems reasonable.

But for the moment my little hard coded program will get me through my next gaming session.

Mumble patch

--- a/src/mumble/DBus.cpp
+++ b/src/mumble/DBus.cpp
@@ -101,3 +101,11 @@
 bool MumbleDBus::isSelfDeaf() {
 	return g.s.bDeaf;
 }
+
+void MumbleDBus::startTalk() {
+	g.mw->on_PushToTalk_triggered(true, 0);
+}
+
+void MumbleDBus::stopTalk() {
+	g.mw->on_PushToTalk_triggered(false, 0);
+}
--- a/src/mumble/DBus.h
+++ b/src/mumble/DBus.h
@@ -52,6 +52,8 @@
 		void setSelfDeaf(bool deafen);
 		bool isSelfMuted();
 		bool isSelfDeaf();
+                void startTalk();
+                void stopTalk();
 };
 
 #endif

Hackish program.
You'd need to change the keys its looking for, set the right device, set the user id, and set the do a export DBUS_SESSION_BUS_ADDRESS in the root session before it'd work for someone else.

#include <linux/input.h>
#include <linux/input-event-codes.h>
#include <unistd.h>

#include <QtCore/QCoreApplication>
#include <QtDBus/QtDBus>

#include <stdio.h>

#define SERVICE_NAME "net.sourceforge.mumble.mumble"

int main(int argc, char **argv) {
  QCoreApplication app(argc, argv);  
  FILE *mice = NULL;
  struct input_event e;
  int read;

  mice = fopen("/dev/input/by-id/usb-Logitech_USB_Receiver-if01-event-mouse", "rb");
  if (mice == NULL) {
    printf("unable to open mice device %d\n", errno);
    return -1;
  }
  
  fprintf(stderr, "uid %d, euid %d\n", getuid(), geteuid());
  setuid(1000);
  fprintf(stderr, "uid %d, euid %d\n", getuid(), geteuid());

  if (!QDBusConnection::sessionBus().isConnected()) {
    fprintf(stderr, "Cannot connect to the D-Bus session bus.\n");
    return -2;
  }

  QDBusInterface mumble(SERVICE_NAME, "/", "", QDBusConnection::sessionBus());
  if (!mumble.isValid()) {
    fprintf(stderr, "Failed to connect to %s\n", SERVICE_NAME);
    return -1;
  }
    
  while (1) {
    read = fread(&e, sizeof(struct input_event), 1, mice);
    if (read == 1) {
      if (e.type == EV_KEY && e.code == BTN_EXTRA) {
          QDBusReply<void> reply;
          if (e.value) {
            // mouse down
            reply = mumble.call("startTalk");
          } else {
            // mouse up
            reply = mumble.call("stopTalk");
          }
      }
    }
  }
}

zevdg · 2019-05-24T14:27:01Z

Let’s say Teamspeak is running too, pressing the key for voip/push-to-talk would lead to the action event being sent to either Mumble or Teamspeak (for example, based on the last focused one).

It seems weird to disallow broadcast (or multicast) in the spec. I can imagine examples where the user would want to broadcast voip/push-to-talk to multiple apps. Imagine a gamer with their friends on mumble, but also other randomly matched teammates using the voip built into the game. They'd have one key bound to mumble/push-to-talk for their friends, but when they want to talk to their whole team, they want one button to activate push-to-talk in both apps.

Of course, usually in this case, the game would be in focus and could get the key presses that way, but that seems like a weird and unnecessary limitation. As that user, I would expect my global push-to-talk key to keep working even if I alt-tab out to my browser for a second.

Could we leave it entirely up to the compositor to decide where to send the actions?

sardemff7 · 2019-05-24T14:41:17Z

From my proposed protocol:

Here are some examples of dispatching choice: all applications, last
focused, user-defined preference order, latest fullscreened application.

It is up to the compositor. With that protocol, anyway.
But there are still no implementations that I know about and the protocol was never accepted anywhere.

zevdg · 2019-05-24T17:03:59Z

Doh. I should have looked at the actual proposal instead of only reading this issue.

setpill · 2019-07-25T12:05:27Z

Would it be an idea to allow push-to-talk to be triggered through the RPC (to both start and stop)? That way, under wayland, one can simply configure their desktop environment to run mumble rpc speak-start on keydown and mumble rpc speak-stop on keyup (or whatever naming is decided).

Edit: upon closer inspection, the dbus PR already covers my usecase.

lheckemann · 2020-03-23T08:58:57Z

@detrout's patch + following sway config snippet (with swaywm/sway#5132) is working nicely for me:

    bindsym --no-repeat F12 exec gdbus call -e -d net.sourceforge.mumble.mumble -o / -m net.sourceforge.mumble.Mumble.startTalk
    bindsym --release F12 exec gdbus call -e -d net.sourceforge.mumble.mumble -o / -m net.sourceforge.mumble.Mumble.stopTalk

Would there be any problems with merging the patch?

ghost · 2020-03-23T09:22:06Z

It's already merged, but after 1.3.0 release, so we need to wait for the next release of Mumble.

Krzmbrzl · 2020-03-23T10:08:02Z

It'll be part of the 1.4 release. Since it is not a hotfix, it won't be in 1.3.1.

Krzmbrzl · 2020-03-23T10:09:23Z

Since #3675 has been merged into master already, I'll also close this issue as resolved.

lheckemann · 2020-03-23T14:03:35Z

Oh, I see. Thanks! I've opened swaywm/sway#5132 to allow using this without constantly reinvoking the startTalk call while the key is pressed.

carlwgeorge · 2020-08-18T22:48:31Z

If any Fedora users want to try out the dbus calls from #3675, I have created a copr repository with that pull request patched into version 1.3.2.

https://copr.fedorainfracloud.org/coprs/carlwgeorge/mumble-wayland/

You'll still have to configure the dbus calls as shortcuts in your desktop environment.

gimco · 2020-09-25T07:22:22Z

I was annoyed with this, but in my case, after activating the topicons extension (to get legacy trayicons on top bar) the keyboard shortcut started to be received by mumble! So, it's a workaround but it maybe helpful meantime

JuniorJPDJ · 2021-07-17T15:48:34Z

bump

Krzmbrzl · 2021-07-18T06:30:51Z

@JuniorJPDJ To my knowledge Wayland does not provide support for registering global shortcuts but inhibits keyboard event polling (what we are currently doing) when the application doesn't have focus.

That means we'd have to write specific code for every (major) DE out there in order to hook into their way of doing this (if there is one).
That's just not really feasible since afaik there is no standard for this across DEs.

Thus as far as long as this situation doesn't change, I don't see us going down that route. Imo this is just a major missing feature of Wayland.

davidebeatrici · 2021-07-18T17:46:03Z

We have to prioritize Evdev over X11.

vchernin · 2021-10-13T16:36:34Z

The best solution is probably: flatpak/xdg-desktop-portal#624

This would ideally be a universal way of registering global keyboard shortcuts. You wouldn't need to do compositor-specific tweaks.

Rush · 2022-09-29T17:47:55Z

I published a small generic workaround for Wayland push to talk that listens on specific key events on evdev and pushes those events to Xwayland. https://github.com/Rush/wayland-push-to-talk-fix

I think it can be easily adjusted to send those to dbus instead.

mkrautz mentioned this issue Nov 26, 2017

GlobalShortcut_unix: evdev mode should not fall back to XInput2 when no keyboard device nodes area available #3269

Open

davidebeatrici added linux client priority/P2 - Important labels Dec 29, 2017

robozman mentioned this issue Apr 24, 2019

Added DBus calls to activate and deactivate push to talk #3675

Merged

Krzmbrzl closed this as completed Mar 23, 2020

ghost mentioned this issue Apr 17, 2020

Push-to-Talk key hangs on Wayland #4073

Closed

Krzmbrzl mentioned this issue May 18, 2020

Cannot assign keybinds #3816

Closed

Krzmbrzl mentioned this issue Apr 5, 2021

Cannot assign shortcut key press to mumble as standard user Fedora 34 - Gnome 40 - Wayland #4907

Closed

Krzmbrzl mentioned this issue Sep 7, 2021

Keyboard shortcuts don't work on Wayland #5257

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Push to talk does not work in Wayland / Gnome3 #3243

Push to talk does not work in Wayland / Gnome3 #3243

itsrachelfish commented Oct 4, 2017

sardemff7 commented Oct 5, 2017

mkrautz commented Nov 26, 2017

mkrautz commented Nov 26, 2017

mkrautz commented Nov 26, 2017

mkrautz commented Nov 26, 2017

mkrautz commented Nov 26, 2017

sardemff7 commented Nov 26, 2017

detrout commented Nov 7, 2018

zevdg commented May 24, 2019 •

edited

Loading

sardemff7 commented May 24, 2019

zevdg commented May 24, 2019

setpill commented Jul 25, 2019 •

edited

Loading

lheckemann commented Mar 23, 2020 •

edited

Loading

ghost commented Mar 23, 2020

Krzmbrzl commented Mar 23, 2020

Krzmbrzl commented Mar 23, 2020

lheckemann commented Mar 23, 2020

carlwgeorge commented Aug 18, 2020

gimco commented Sep 25, 2020

JuniorJPDJ commented Jul 17, 2021

Krzmbrzl commented Jul 18, 2021

davidebeatrici commented Jul 18, 2021

vchernin commented Oct 13, 2021

Rush commented Sep 29, 2022

Push to talk does not work in Wayland / Gnome3 #3243

Push to talk does not work in Wayland / Gnome3 #3243

Comments

itsrachelfish commented Oct 4, 2017

sardemff7 commented Oct 5, 2017

mkrautz commented Nov 26, 2017

mkrautz commented Nov 26, 2017

mkrautz commented Nov 26, 2017

mkrautz commented Nov 26, 2017

mkrautz commented Nov 26, 2017

sardemff7 commented Nov 26, 2017

detrout commented Nov 7, 2018

zevdg commented May 24, 2019 • edited Loading

sardemff7 commented May 24, 2019

zevdg commented May 24, 2019

setpill commented Jul 25, 2019 • edited Loading

lheckemann commented Mar 23, 2020 • edited Loading

ghost commented Mar 23, 2020

Krzmbrzl commented Mar 23, 2020

Krzmbrzl commented Mar 23, 2020

lheckemann commented Mar 23, 2020

carlwgeorge commented Aug 18, 2020

gimco commented Sep 25, 2020

JuniorJPDJ commented Jul 17, 2021

Krzmbrzl commented Jul 18, 2021

davidebeatrici commented Jul 18, 2021

vchernin commented Oct 13, 2021

Rush commented Sep 29, 2022

zevdg commented May 24, 2019 •

edited

Loading

setpill commented Jul 25, 2019 •

edited

Loading

lheckemann commented Mar 23, 2020 •

edited

Loading