-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug] [seatunnel-engine-storage] map and checkpoint Writing HDFS Kerberos tickets with automatic 24-hour expiration #7102
Labels
Comments
@liunaijie 请帮忙看看呢 |
hi, please attach all the code you change, or give the repo link. |
i met the same problem |
@liunaijie refresh FileSystem cannot solve this problem |
This is an issue, I will take a look. |
4 tasks
@shenzhy5 @weipengfei-sj I create a pr to fix this. can you help cherry-pick this commit and verify? |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Search before asking
What happened
采用seatunnel2.3.5版本,3个节点的集群模式
hazelcast.yaml 配置如下:
map:
engine*:
map-store:
enabled: true
initial-mode: EAGER
factory-class-name: org.apache.seatunnel.engine.server.persistence.FileMapStoreFactory
properties:
type: hdfs
namespace: /tmp/seatunnel/imap
clusterName: seatunnel-cluster
storage.type: hdfs
fs.defaultFS: hdfs://fss:8020
kerberosPrincipal: hdfs
kerberosKeytabFilePath: /applinkis/ceph/share/hadoopcluster/fss/keytab/hdfs.keytab
krb5Path: /app/linkis/seatunnel/config/krb5.conf
seatunnel.hadoop.dfs.nameservices: fss
seatunnel.hadoop.dfs.ha.namenodes.fss: nn1,nn2
seatunnel.hadoop.dfs.namenode.rpc-address.fss.nn1: nn1:8020
seatunnel.hadoop.dfs.namenode.rpc-address.fss.nn2: nn2:8020
seatunnel.hadoop.dfs.client.failover.proxy.provider.usdp-bing: org.apache.hadoop.hdfs.server.namenode.ha.ConfiguredFailoverProxyProvider
seatunnel.hadoop.dfs.namenode.kerberos.principal: nn/_HOST@T1.COM
seatunnel.hadoop.dfs.datanode.kerberos.principal: dn/_HOST@T1.COM
seatunnel.hadoop.rpc.protection: authentication
seatunnel.hadoop.security.authentication: kerberos
hdfs_site_path: /applinkis/ceph/share/hadoopcluster/fss/hadoop/hdfs-site.xml
配置map信息写入到hdfs上,当集群运行超过24h之后,观察服务日志,发现写hdfs存在kerberos票据过期问题
分析源码如下:
如果采用该方式认证hdfs写入hdfs,不自动刷新票据的逻辑话,必然存在票据过期的问题出现
尝试修改代码,增加认证后,启动定时任务自动刷新机制:
但是增加上述自动刷新kerberos票据机制之后,24h后,服务写hdfs仍然报存在票据不可用的问题
另外尝试了多个地方,比如在HdfsWriter类中也增加了票据自动刷新机制,但是均不生效,请社区的大佬帮忙指正一下,非常感谢
SeaTunnel Version
2.3.5
SeaTunnel Config
Running Command
Error Exception
Zeta or Flink or Spark Version
No response
Java or Scala Version
No response
Screenshots
No response
Are you willing to submit PR?
Code of Conduct
The text was updated successfully, but these errors were encountered: