Forum Discussion
Barbara_123
Feb 25, 2025Copper Contributor
from pdf to excel without losing its format or data changing positions
I have a pdf that I need to put into excel 365 and make a table. So I used power query. However, when it goes into excel it loses the format and the data in the first two columns changes place. I am not an expert in excel whatsoever. I know the basics, so this is baffling me. Also is there a way to repeat the date (bearing in mind there are several dates on the sheet) and the tour so it would be a date and a tour code in each line?
Thank you in advance for any help
original pdf
when in excel
3 Replies
Sort By
- insightsgeekBrass Contributor
Hi,
Follow these steps to clean and structure your data properly:
- Import the PDF Correctly
Open Excel 365 and go to Data → Get Data → From File → From PDF.
Select the PDF file and let Power Query extract the tables.
Choose the correct table (check the preview to ensure it captures the data properly).
Click Load To... and select Power Query Editor instead of loading directly into Excel. - Fix Column Shifting Issue in Power Query
Identify if the Date or Tour columns have missing values, which might be causing misalignment.
Ensure that the first row is properly set as headers:
Go to Transform → Use First Row as Headers.
Manually rename columns if necessary (Home → Transform → Rename Columns).
Reorder columns if needed by dragging them into place in Power Query. - Fill Down Dates and Tour Codes in Power Query
Since Dates and Tour Codes only appear once per section in the original PDF, they need to be repeated for every row.
Select the Date column → Click Transform → Fill Down.
Select the Tour column → Click Transform → Fill Down. - Ensure Data is in Table Format
Click "Close & Load" to bring the cleaned data back into Excel.
If not automatically formatted as a table, select the data → Press Ctrl + T to convert it into a table. - Tips for Formatting
If numbers appear as text:
Select the column → Transform → Detect Data Type → Set to "Whole Number" or "Decimal Number".
If blank rows appear:
Use Remove Blank Rows under the Home tab in Power Query.This should maintain your data’s structure and allow easy analysis in Excel. Let me know if you need further clarification! 😊
- Barbara_123Copper Contributor
Thank you for your reply.... I was trying what you said and all goes well until click load to.... WHat it shows is this: which should I chose? thank you
- Import the PDF Correctly