Using PHP to convert tabs to spaces for HTML display?

Question

I need to display a plain-text file that contains two-space tab’d columns of data in a web page.

What I did was to use PHP to read the text file and print it out between <pre> tags to use a monospace font as so:

<pre>
<?php
  $fn="data.txt";
  $fi=fopen($fn, "r");
  $fc=fread($fi, filesize($fn));         //open and read text file
  fclose($fi);
  $fc=str_replace("\t", "  ", $fc);      //replace tabs with two spaces
  print($fc);                            //print data between PRE tags
?>
</pre>

It almost works, but the tabs are being troublesome. It is trivial to replace the tabs with two spaces, but then non-whitespace characters are pushed over instead of absorbed into the tabs. True tabs absorb n-1 non-whitespace characters (where n is the number of spaces per tab).

For example, the following table should be displayed as so:

|   | 43| 43|  7|   |   |
| 12|128|128|128|   | 53|
|  3|  3|  3|  3|   |   |
|   |   | 21| 21| 39|   |

However by blindly replacing all tabs with two-spaces, we get this:

|    |  43|  43|    7|   |   |
|  12|128|128|128|   | 53|
|   3|   3|   3|   3|   |   |
|   |   |  21|  21|  39|   |

I’m trying to figure out a (reasonably easy) way to convert the tabs to spaces while accounting for tabs that don’t take up the full n spaces.

Unfortunately, I believe you need a second-pass to accurately calculate the column's width — Alexander
– Alexander, Commented Jan 13, 2013 at 21:22
What is reasonably easy? It's not hard to do if you allow for some looping. I'm thinking str_pad — Daniel Figueroa
– Daniel Figueroa, Commented Jan 13, 2013 at 21:29
You could probably achieve this effect with preg_replace_callback and printf, read up on those two. — Madara's Ghost
– Madara's Ghost, Commented Jan 13, 2013 at 21:33
@MadaraUchiha, printf is a great idea, but the callback receives an array of matches, in other words, it would get an array of tabs without the non-whitespace characters that come after them. I suppose it might be possible to parse using the column delimiters (assuming they exist like in this particular case). Of course at that point, it would probably be easier to process the file line-by-line instead of doing a string-replace. — Synetech
– Synetech, Commented Jan 13, 2013 at 22:44

dev-null-dweller · Accepted Answer · 2013-01-13 22:48:39Z

6

I have written this function some time ago, might be helpful:

function tab2space($line, $tab = 4, $nbsp = FALSE) {
    while (($t = mb_strpos($line,"\t")) !== FALSE) {
        $preTab = $t?mb_substr($line, 0, $t):'';
        $line = $preTab . str_repeat($nbsp?chr(7):' ', $tab-(mb_strlen($preTab)%$tab)) . mb_substr($line, $t+1);
    }
    return  $nbsp?str_replace($nbsp?chr(7):' ', '&nbsp;', $line):$line;
}

It was meant to deal with multibyte strings, if you have only numbers, you can get rid of mb_, it will speed up this function.

[+] Note that this is meant to work with one line, so you will need to process line by line with fgets instead of whole file at once.

edited Jan 13, 2013 at 22:48

answered Jan 13, 2013 at 22:03

dev-null-dweller

29.5k3 gold badges68 silver badges87 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Synetech Over a year ago

[+] Note that this is meant to work with one line, so you will need to process line by line with fgets instead of whole file at once.

Ah, that must be why lines (after the first one) that start with a double-tab are short one space. No problem, it was simple enough to fix: $lines=explode("\n", $fc); foreach ($lines as $l) print(tab2space($l)."\n");

Pierrickouw · Accepted Answer · 2013-01-13 21:29:15Z

1

You can try to use printf function.

Here an example :

printf("%4d",'37'); // will print ' 37' (with 2 spaces before 37) 
printf("%6d",'37'); // will print '   37' (with 4 spaces before 37) 
printf("%6d",'337'); // will print '  37' (with 3 spaces before 37)

Some informations about format here.

(For your information, the same trick is available with C)

answered Jan 13, 2013 at 21:29

Pierrickouw

4,7341 gold badge34 silver badges29 bronze badges

2 Comments

Julien Schmidt Over a year ago

But you must remove the tabs first (e.g with $fc = str_replace("\t", "", $fc);).

Synetech Over a year ago

You can try to use printf function.: printf("%d", '37'); But how would you extract the value? (For your information, the same trick is available with C) Something tells me I know that already. ;-)

AlienHoboken · Accepted Answer · 2013-01-13 21:46:51Z

0

First, get rid of all tabs and spaces:

$fc=str_replace("\t", "", $fc);
$fc = str_replace(" ", "", $fc);

Then apply these replacements. The loops are because the replacements may not hit all possible cases the first time they are run:

//deal with the case of two pipes next to each other
while(strpos($fc, "||") !== false)
   $fc = str_replace("||", "|   |", $fc);

//deal with the case of |XX|
while(preg_match('/\|[0-9][0-9]\|/', $fc) !== 0)
    $fc = preg_replace('/\|([0-9])([0-9])\|/', '| ${1}${2}|', $fc);

//deal with the case of |X|
while(preg_match('/\|([0-9])\|/', $fc) !== 0)
   $fc = preg_replace('/\|([0-9])\|/', '|  ${1}|', $fc);

Since you have three space columns, no need to do anything for 3 digit numbers (|XXX|).

This should work!

answered Jan 13, 2013 at 21:46

AlienHoboken

2,80022 silver badges23 bronze badges

2 Comments

Synetech Over a year ago

Actually, I had already come up with a similar work-around. I used the (smaller) code as follows, and while it suits my current needs, it is specific to this one particular format and would have to be manually changed for other column widths, delimiters, tabs-per-space, etc. and would quickly become untenable.

$fc=str_replace("\n\t\t", "    ", $fc);     $fc=str_replace("\t\t", "   ", $fc);     $fc=str_replace("\t|", "  ", $fc);     $fc=str_replace("\t", " ", $fc);

AlienHoboken Over a year ago

Ah, I see. I was not aware you'd be working with other formats.

Collectives™ on Stack Overflow

Using PHP to convert tabs to spaces for HTML display?

3 Answers 3

1 Comment

2 Comments

2 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

1 Comment

2 Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related