Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Boundary v0.14 - error fetching connection to send session teardown request #4975

Open
rfc2119 opened this issue Jul 29, 2024 · 0 comments
Open
Labels
bug Something isn't working

Comments

@rfc2119
Copy link

rfc2119 commented Jul 29, 2024

Describe the bug
At random points in time, all clients (Desktop and CLI) already connected to the Kubernetes worker are unable to connect. After 5 minutes or so, the connection comes back and clients are able to connect to the requested targets. This happens for all targets and clients.

On the Desktop client, I can't seem to find any relevant logs. On the CLI client, here are the logs:

error fetching connection to send session teardown request to worker: Error dialing the worker: failed to WebSocket dial: failed to send handshake request: Get "http://public-k8s-worker:9202/v1/proxy": context deadline exceeded

This keeps repeating until the connection/session comes back online. Polling /worker-info on the Kubernetes worker yields "READY" for GRPC upstream connection state:

{
  "worker_process_info": {
    "state": "active",
    "active_session_count": 14,
    "session_connections": {
      "s_9MntRSIZK6": 15,
      "s_AJCsJ4ELib": 16,
      "s_F0N0pBIorY": 11,
      "s_U8d9dLbKlx": 3,
      "s_Xga4gGHkFK": 11,
      "s_Zgtv1w90w6": 47,
      "s_afFHlpg6H7": 16,
      "s_ap8gJvRmlv": 8,
      "s_c0CL0p1SPO": 8,
      "s_dae6ijxKrP": 25,
      "s_geWN5MEe9k": 10,
      "s_r0L1lfVe54": 13,
      "s_re1VmSCNjf": 2,
      "s_z2BOSqwAs6": 2
    },
    "upstream_connection_state": "READY"
  }
}

To Reproduce
This issue happens randomly with no known interval. The frequency of the issue is not uniform.

Expected behavior
All sessions opened should not be interrupted

Additional context
Worker version: v0.15.4
Controller version: v0.14.5
CLI version: 0.14.3
3 controllers in HA setup and one Kubernetes worker

@rfc2119 rfc2119 added the bug Something isn't working label Jul 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant