Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jaeger Tracing Looks Problematic #342

Open
Orfeasfil opened this issue Jun 7, 2024 · 1 comment
Open

Jaeger Tracing Looks Problematic #342

Orfeasfil opened this issue Jun 7, 2024 · 1 comment

Comments

@Orfeasfil
Copy link

Hi there!

I deployed DeathStarBench on a Kubernetes cluster using helm. We initialized the social network graph and used the wrk command to run the following services:

  1. read-home-timeline
  2. compose-post-service
  3. read-user-timeline

After running these services, we attempted to retrieve the traces via the Jaeger UI. The results seem problematic. For each service, a random request yields the following results:

read-home-timeline:
command: /wrk2/wrk -D exp -t 100 -c 1000 -d 5 -L -s ./wrk2/scripts/social-network/read-home-timeline.lua http://${NGINX-THRIFT-SERVICE-IP}:8080/wrk2-api/home-timeline/read -R 100
results:
image

  • Total latency of the request: 394.71ms
  • Total latency of the services: 3.12ms

The latencies shown do not add up correctly, which seems erroneous.

compose-post-service:
command: /wrk2/wrk -D exp -t 10 -c 1000 -d 5 -L -s ./wrk2/scripts/social-network/compose-post.lua http://${NGINX-THRIFT-SERVICE-IP}:8080/wrk2-api/post/compose -R 100
results:
image

read-user-timeline :
command: /wrk2/wrk -D exp -t 100 -c 1000 -d 5 -L -s ./wrk2/scripts/social-network/read-user-timeline.lua http://${NGINX-THRIFT-SERVICE-IP}:8080/wrk2-api/user-timeline/read -R 100
results:
image

Troubleshooting Attempts:
We experimented with different sampleParam values for Jaeger (0.01, 0.1, and 1), but the results remained consistent with the same discrepancies.

Steps to Reproduce:

  1. Create a Kubernetes cluster.
  2. Follow the instructions to initialize the social graph.
  3. Run the read-home-timeline, compose-post-service, and read-user-timeline services.
  4. Retrieve the traces via the Jaeger UI.

Do you have any idea on whats happening? Any suggestions or insights would be greatly appreciated.

@ccs1110
Copy link

ccs1110 commented Nov 22, 2024

it looks like that you delployed the microservices in different devices,and the transfer latency is bigger than service latency

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants