-
Notifications
You must be signed in to change notification settings - Fork 811
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Kubernetes Nat Manager do not print exception message (maybe is throwing a exception without message) #3787
Comments
Hi @Vjtom We've got a repo dedicated to k8s available here https://github.com/ConsenSys/quorum-kubernetes as well, the repo has been updated recently and the docs are in the process of being updated hyperledger/besu-docs#1030 Cheers |
Thank you @joshuafernandes for the reply, First I have tried with the service exactly like this doc: https://besu.hyperledger.org/en/stable/Tutorials/Kubernetes/Nat-Manager-Kubernetes/ Here is the manifest
And a deployment I have tried like this:
I have altered the besu_miner_coinbase and filled with ... but there is a actual besu_miner_coinbase in this variable. |
Hi @Vjtom I think some of the selector labels are a bit off? I've done the following without issue:
which gave me the logs below:
|
Thank you so much for the effort to reproduce! Looking at the code, it has a excpetion in this part:
I saw that it has a e.getMassage(), tha's why i created this issue, because it seams that the exception raised had no message to display. And now I don't know what is going on under the hood, |
Hi @Vjtom. Here's the logs at debug level.
@matkt does the above look right mate? |
@Vjtom what happens when you manually set the NAT manager mode? https://besu.hyperledger.org/en/latest/Tutorials/Kubernetes/Nat-Manager-Kubernetes/#automatic-detection-errors |
Look, your log is like mine, it also printed the part What do you mean setting manually? |
Set the cli options/env manually as per https://besu.hyperledger.org/en/latest/HowTo/Find-and-Connect/Specifying-NAT/ and see what you get. |
This I Already done! The same happens in AUTO or KUBERNETES. |
👍 @matkt could you have a look please - am not sure why this is happening |
looking at the log it seems it comes from this line https://github.com/hyperledger/besu/blame/ed1329cf848a9d7858d039a55add810da3c428b5/nat/src/main/java/org/hyperledger/besu/nat/kubernetes/KubernetesNatManager.java#L81 |
Hi @matkt not been able to mate. It seems that the message re failing is hidden in debug - might be worth putting that in info too? Our charts in quorum-kubernetes use CNI for networking so we bypass this so to speak. The most recent attempt I did of this was above here where it fails silently. I'll try an older version and see how far I get mate |
OK, thanks. because I have serious doubts about the last modification of 4 months ago which precisely affects the library |
Hi @matkt seems like the exception piece broke at 21.10.7 Ran the above files on minikube and upto 21.10.6 I get a reason, from 21.10.7 its a null value 2022-05-17 21:45:13.631+00:00 | OkHttp ConnectionPool | DEBUG | TaskRunner | Q10000 run again after 300 s : OkHttp ConnectionPool
2022-05-17 21:45:13.631+00:00 | main | DEBUG | NatService | Nat manager failed to configure itself automatically due to the following reason : . NONE mode will be used
2022-05-17 21:45:13.631+00:00 | OkHttp TaskRunner | DEBUG | TaskRunner | Q10000 finished run in 619 ?s: OkHttp ConnectionPool
21.10.6
2022-05-17 21:53:50.534+00:00 | main | INFO | KubernetesNatManager | Starting kubernetes NAT manager.
2022-05-17 21:53:50.548+00:00 | main | DEBUG | KubernetesNatManager | Trying to update information using Kubernetes client SDK.
2022-05-17 21:53:50.860+00:00 | main | DEBUG | NatService | Nat manager failed to configure itself automatically due to the following reason : Forbidden. NONE mode will be used
2022-05-17 21:53:50.860+00:00 | main | INFO | NetworkRunner | Starting Network.
2022-05-17 21:53:50.879+00:00 | main | DEBUG | DefaultChannelId | -Dio.netty.processId: 1 (auto-detected)
21.10.4
2022-05-17 21:52:31.502+00:00 | main | INFO | KubernetesNatManager | Starting kubernetes NAT manager.
2022-05-17 21:52:31.516+00:00 | main | DEBUG | KubernetesNatManager | Trying to update information using Kubernetes client SDK.
2022-05-17 21:52:31.825+00:00 | main | DEBUG | NatService | Nat manager failed to configure itself automatically due to the following reason : Forbidden. NONE mode will be used
2022-05-17 21:52:31.825+00:00 | main | INFO | NetworkRunner | Starting Network.
21.10.1
2022-05-17 21:48:15.643+00:00 | main | DEBUG | KubernetesNatManager | Trying to update information using Kubernetes client SDK.
2022-05-17 21:48:15.928+00:00 | main | DEBUG | NatService | Nat manager failed to configure itself automatically due to the following reason : Forbidden. NONE mode will be used
2022-05-17 21:48:15.928+00:00 | main | INFO | NetworkRunner | Starting Network.
21.7.4
SLF4J: Actual binding is of type [org.apache.logging.slf4j.Log4jLoggerFactory]
2022-05-17 21:50:54.590+00:00 | main | DEBUG | KubernetesNatManager | Trying to update information using Kubernetes client SDK.
2022-05-17 21:50:54.853+00:00 | main | DEBUG | NatService | Nat manager failed to configure itself automatically due to the following reason : Forbidden. NONE mode will be used
2022-05-17 21:50:54.854+00:00 | main | INFO | NetworkRunner | Starting Network. |
Hum, ok so it's just an exception issue. I will check why we have this kind of modification on this part |
hyperledger#3787 Signed-off-by: Antony Denyer <git@antonydenyer.co.uk>
hyperledger#4023) hyperledger#3787 Signed-off-by: Antony Denyer <git@antonydenyer.co.uk>
Description
As an Developer, I want to know what is the issue when starting nodes on kubernetes nat manager so that I can read what is going on with the exception raised.
Acceptance Criteria
When running besu on kubernetes, the message bring me somenthing useful.
Steps to Reproduce (Bug)
But in my case the message is:
2022-05-04 19:51:57.105+00:00 | main | DEBUG | NatService | Nat manager failed to configure itself automatically due to the following reason : . NONE mode will be used
2022-05-04 19:51:57.106+00:00 | main | INFO | NetworkRunner | Starting Network.
There is no exception message following the reason:
Expected behavior:
I expect when the exception have no message, to send its stacktrace at least. This message is displayed on debug, maybe the stacktrace is ok?
Actual behavior: The message is printed without de exception message, or the exception raised don't have a message
2022-05-04 19:51:57.105+00:00 | main | DEBUG | NatService | Nat manager failed to configure itself automatically due to the following reason : . NONE mode will be used
2022-05-04 19:51:57.106+00:00 | main | INFO | NetworkRunner | Starting Network.
Frequency: I still coudn't get peer discovery working
Versions (Add all that apply)
cat /etc/*release
]NAME="Amazon Linux"
VERSION="2"
ID="amzn"
ID_LIKE="centos rhel fedora"
VERSION_ID="2"
PRETTY_NAME="Amazon Linux 2"
ANSI_COLOR="0;33"
CPE_NAME="cpe:2.3:o:amazon:amazon_linux:2"
HOME_URL="https://amazonlinux.com/"
Amazon Linux release 2 (Karoo)
uname -a
]Linux ip-172-24-26-123.sa-east-1.compute.internal 5.4.186-102.354.amzn2.x86_64 Update Jenkins Build for Pantheon → Besu considerations #1 SMP Tue Mar 29 23:49:00 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux
docker version
]Version: 20.10.13
API version: 1.41
Go version: go1.16.15
Git commit: a224086
Built: Thu Mar 31 19:20:32 2022
OS/Arch: linux/amd64
Context: default
Experimental: true
Smart contract information (If you're reporting an issue arising from deploying or calling a smart contract, please supply related information)
solc --version
]Additional Information (Add any of the following or anything else that may be relevant)
The text was updated successfully, but these errors were encountered: