I was wondering, in your esteemed experiences, what is preferable when it comes to cleaning data imports: After or Before the import?
For example, I import a series of excel files from various users and use those to update live table data in the Access Database via a SQL INNER JOIN/UPDATE query. However, a lot of this data is not suitable for flowing into the database for many various reasons. For example, statuses might have been left blank, or improper codes used. It is difficult for me to control validation for these different sources across users so my only option at the moment is to clean the data before the import.
Do you guys think it is better to clean the data pre import or post? Cleaning it post would involve querying as well as updating the data (probably with sql or VBA) whereas cleaning it pre-import would involve using vba/R/python/etc. to tear things up and set things straight before funneling it in. Thoughts?