How to verify if a variable is empty in python

Question

I'm working with data in .csv format and want to set all the empty cells to the value of an empty string.

The problem that I'm facing is that those files have been manipulated for several people in different environments, hence there are various different junk values on these cells, such as:

' '
'NaN'
'nan'
'\n'
'   '

And so on.

I'm looking for a standard way to identify all of these types of "junk values."

if yourStringVar.strip(): ?
– CristiFati
Commented Apr 23, 2017 at 2:13 — CristiFati, Commented Apr 23, 2017 at 2:13
it doesn't work for 'NaN'
– Luis Ramon Ramirez Rodriguez
Commented Apr 23, 2017 at 3:41 — Luis Ramon Ramirez Rodriguez, Commented Apr 23, 2017 at 3:41

Ned Batchelder · Accepted Answer · 2017-04-23 02:18:27Z

4

Use .strip() to remove whitespace, and then check if the value is one you want to ignore:

if value.strip() in ['', 'NaN', 'nan']:
    # ignore this value

Or, make it case-insensitive:

if value.strip().lower() in ['', 'nan']:
    # ignore this value

answered Apr 23, 2017 at 2:18

Ned Batchelder

372k76 gold badges573 silver badges669 bronze badges

Add a comment |

MSeifert · Accepted Answer · 2017-04-23 02:21:28Z

2

You can use the isspace function which would eliminate whitespace values like ' ' and '\n' but would not handle values like 'NaN' or 'nan'. There isn't really a standard way to deal with these, so in addition to using isspace I would also create a blacklist, e.g.:

blacklist = ['NaN', 'nan'] # add more as needed

Then use isspace() plus your blacklist to filter out unwanted values.

edited Apr 23, 2017 at 2:21

MSeifert

151k39 gold badges342 silver badges364 bronze badges

answered Apr 23, 2017 at 2:15

nb1987

1,4101 gold badge12 silver badges12 bronze badges

Add a comment |

user7019687user7019687 · Accepted Answer · 2017-04-23 02:16:50Z

0

You could read the csv into a Pandas DataFrame, and then use DataFrame.fillna().

answered Apr 23, 2017 at 2:16

user7019687

Add a comment |

Wenlong Liu · Accepted Answer · 2017-04-23 02:22:18Z

0

I think pandas.replace would be a good alternative for your problem.

Following are some sample codes:

import pandas as pd
# sample data
dic = {'a':['NAN', "", "NaN"], 'b':["", "nan", '\n'], 'c':[1,'2','3']}
df = pd.DataFrame(dic)

replace_list = ['NaN', '', 'nan', '\n']
df_clean = df.replace(replace_list, '')
df_clean

You can import csv data to Pandas and do the same thing.

Hope it helps.

answered Apr 23, 2017 at 2:22

Wenlong Liu

4442 silver badges13 bronze badges

Add a comment |

Collectives™ on Stack Overflow

How to verify if a variable is empty in python

4 Answers 4

Not the answer you're looking for? Browse other questions tagged
python
string
or ask your own question.

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Not the answer you're looking for? Browse other questions tagged pythonstring or ask your own question.

Related

Not the answer you're looking for? Browse other questions tagged
python
string
or ask your own question.