-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Networking Issues: Zeppelin cannot fetch dependencies #3
Comments
I think the Kafka servers may have gotten in a twist due to excessively bringing them up/down. Deleting the kafka/kafka_zookeeper Docker volumes seems to have fixed the
Note this is using my code above to add the JAR to the I've seen some other people with what looks like the same problem (see https://stackoverflow.com/questions/58975545/azure-databricks-kafkautils-createdirectstream-causes-py4jnetworkerroranswer) and they've suggested upgrading the JAR. I've tried editing the Zeppelin interpreters to use the newer "spark-streaming-kafka-0-8_2.11-2.4.5.jar" but this results in the same set of issues. |
I am facing with the same issues. Have you resolved it @gregbrowndev yet? |
Using the assembly solved the issue:
|
@dotdothu sorry for the late reply! Glad you managed to fix it. I added the Jar manually, just like you did, but never quite got it completely working. I still had issues with the consumer: "Py4JError: An error occurred while calling o66.createDirectStreamWithoutMessageHandler" |
@gregbrowndev I added your library files to spark dep. below is my entire operation %sh and %producer.dep %consumer.dep |
Hi, and thank you for this repo it is incredibly useful!
However, I'm having problems running your notebook. I'm not really sure what exactly the problem is (so sorry from the ramblings below) as I've tried a number of different things and getting different issues even after reverting all changes back to your exact code.
Initially, upon executing the notebook, everything apart from the consumer seemed to work. I could see messages being written to the Kafka topic in Grafana. However, the consumer complained that it didn't have the Kafka dependency available.
I found that the Kafka JAR hadn't been downloaded, as indicated in the image below of Zeppelin's interpreter page:
I couldn't even wget the JAR from inside the Zeppelin container (I believe the request timedout). However, now (after reverting my changes) it seems I can wget in the container, but Zeppelin is still unable to fetch them itself.
Putting aside the network issue, I've tried to manually add the dependency using:
but now the producer times out when connecting to Kafka:
This is all on a fresh set of containers (
docker-compose down && docker-compose up -d
).Any help would be greatly appreciated!
The text was updated successfully, but these errors were encountered: