Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: Optionally store Audio and Transcription under 'yyyy/mm/dd/hh' #40

Open
PeifferWM opened this issue Oct 15, 2024 · 0 comments

Comments

@PeifferWM
Copy link

Hello,

Currently the WAV files are stored in S3 at the root of the bucket.

client.put_object(Body=wav_bytes.getvalue(), Bucket=f'{os.getenv("s3_recordings_bucket")}', Key=f'{self.contact_id}.wav', Tagging=self.encoded_tags, ContentType='audio/x-wav')

We have an 11 year retention period in place for all files related to Amazon Connect, so these buckets become unmanageable very quickly.
By environment variable, it would be good to store these files similar to how Kinesis Firehose stores in S3, under yyyy/mm/dd/hh/{self.contact_id}.wav

I can make the change to the code after we pull it down, but that makes upgrades challenging. I'm not a Python coder or I'd offer to do a PR. My tweak will be hacky at best. :)

The same environment variable should also ensure the transcriptions are stored in the same location.

Thanks!

@PeifferWM PeifferWM changed the title Feature Request: Partitioned Storage of WAV Feature Request: Optionally store Audio and Transcription under 'yyyy/mm/dd/hh' Oct 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant