Create on-the-fly index column if you don't explicitly specify one #36

friederschueler · 2018-04-20T12:50:02Z

I wonder if you could point me to a starting point, how to implement this and if there are any caveats to think of.

Problem: I generate csv files from a database view and they don't have a unique identifier which could be used as index.
Idea: use the line number of the current row - 1 as index. (like adding a virtual colum in the csv)

With the current implementation this use-case will fail silently, as no changes are reported:
from csvdiff import *
diff_files("e.txt", "f.txt", [], ";")
I would like to implement this functionality and provide a pull request for this feature if you think that is a good idea.

e.txt
f.txt
I had to rename the files to .txt as github doesn't support .csv

The text was updated successfully, but these errors were encountered:

karakutu001 · 2018-08-10T15:13:49Z

I have the same problem. Did you resolve this problem?

I would like to compare two different csv. but unfortunately, csvdiff can't find all the line which are changed.

friederschueler · 2018-08-15T07:57:02Z

@karakutu001 No, I needed a quick fix and it I just prefixed my data files with a "rowID" column and used that. But you are more than welcome to provide some code. I am busy right now, but I could help with it in the next weeks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create on-the-fly index column if you don't explicitly specify one #36

Create on-the-fly index column if you don't explicitly specify one #36

friederschueler commented Apr 20, 2018

karakutu001 commented Aug 10, 2018

friederschueler commented Aug 15, 2018

Create on-the-fly index column if you don't explicitly specify one #36

Create on-the-fly index column if you don't explicitly specify one #36

Comments

friederschueler commented Apr 20, 2018

karakutu001 commented Aug 10, 2018

friederschueler commented Aug 15, 2018