Hmm... have I asked this question here before? LOL
My source data is in tables in PDFs, which I can use PowerQuery to read/parse/clean. That bit works fine. What I would really like to do is use a for each file loop in maybe SSIS and then inside the loop, read the "next" file, parse, and write the results to my SQL Server table. and trap for errors and move the processed files to a final folder. All of that would be trivial in SSIS, except that the PowerQuery source in SSIS doesn't seem to work properly. (Still not certified... I mean, it's only 2+ years old!)
Is there a way to do this? If I execute the PowerQuery from Excel, then I'd have to force the Excel file to requery to cause the PDF files to be re-read. How do I do that in SSIS? Then I can write the data somewhere... but maybe I did it wrong, because SSIS seems to ignore the data types in PowerQuery. Do I need a data type conversion step in there?
There's an add-in that lets you write your PowerQuery results to SQL Server, but it's a black box - you can't do anything with the "intermediate" results (so I wouldn't be able to remove something like redundant records before inserting, for example).
Is there a way to basically output the results into SQL server easily? I've seen a few crazy videos doing it, but the way they do it is nuts (like writing a weird query that will make PowerQuery do writeback).