Forum Discussion

Noone_Important's avatar
Noone_Important
Copper Contributor
Aug 03, 2023

How to remove capital C when followed by a space then return

In a document I am working on, it was OCR'd from a very large document (9,000 pages) that was previously in files with holes punched into each page.

Often during the OCR the holes were coming up as a Capital letter C followed by a return (as per the below). If I just delete capital C's from the entire document I screw up many words. Is there a way I can only delete a capital C when it is immediately followed by a space, then a return?

 

I'm also hoping to remove situations where I have a return at the end of a line, followed by another return on the next line in the below image I just want the document to have one return as per the first paragraph) rather than a return then another return as per the proceeding three lines. Any help with this one too would be awesome.
Thanks in advance to all.

 

2 Replies

  • Noone_Important For the UpperCase C followed by a space and a Pilcrow (¶) use Find and Replace to search for 

     

    C ^p

     

    and replace it with nothing.

     

    For the unwanted empty paragraphs, search for

     

    ^p^p

     

    and replace it with

     

    ^p

    • Noone_Important's avatar
      Noone_Important
      Copper Contributor
      Thanks so much. That worked perfectly. Given you seem to be the bulk-replace guru, here is a tougher one.
      I want to add in a Next Page Section Break everytime in the document it has "/86" without the quotes. This is because the final part of each page is a day in 1986 (written in the text as "xx/xx/86", and I want a next page section break added in there.
      Word doesn't seem to allow me to do this using special characters.
      Can it be done?

Resources