How to skip replacing first occurrence of a character in each line?Check for, and add, missing timestamps to individual lines in a fileReplace a word after a significant line and white spaces (inline) using sed?Replace data at specific positions in txt file using data from another fileHow do I replace the last occurrence of a character in a string using sed?Compare 1st Column in 2 Files and Replace 3rd Column of File 1 with 4th Column of File 2How to search between the 2nd and 3rd delimitersDeleting lines by matching 3rd and 4th character onlyReplace second occurence between colon and bracketHow to replace last occurrence of pattern in a third last line of a fileWant to replace the value of 4th column of data2.csv with the 4th column of other csv file data1.csv where 3rd and 4th column are same in both files
Accidentally cashed a check twice
1980s (or earlier) book where people live a long time but they have short memories
Poisson distribution: why does time between events follow an exponential distribution?
Short story written from alien perspective with this line: "It's too bright to look at, so they don't"
Is the capacitor drawn or wired wrongly?
Who operates delivery flights for commercial airlines?
Metal bar on DMM PCB
What do we gain with higher order logics?
If Boris Johnson were prosecuted and convicted of lying about Brexit, can that be used to cancel Brexit?
How to make thick Asian sauces?
Why is Colorado so different politically from nearby states?
Comma Code - Ch. 4 Automate the Boring Stuff
What flavor of zksnark in tezos
Bent spoke design wheels — feasible?
What happened to all the nuclear material being smuggled after the fall of the USSR?
How do you build a story from a world?
How were concentration and extermination camp guards recruited?
Riley's, assemble!
My coworkers think I had a long honeymoon. Actually I was diagnosed with cancer. How do I talk about it?
Responsibility for visa checking
Could the Missouri River be running while Lake Michigan was frozen several meters deep?
Accidentally renamed tar.gz file to a non tar.gz file, will my file be messed up
How to decline physical affection from a child whose parents are pressuring them?
Java 8: How to convert String to Map<String,List<String>>?
How to skip replacing first occurrence of a character in each line?
Check for, and add, missing timestamps to individual lines in a fileReplace a word after a significant line and white spaces (inline) using sed?Replace data at specific positions in txt file using data from another fileHow do I replace the last occurrence of a character in a string using sed?Compare 1st Column in 2 Files and Replace 3rd Column of File 1 with 4th Column of File 2How to search between the 2nd and 3rd delimitersDeleting lines by matching 3rd and 4th character onlyReplace second occurence between colon and bracketHow to replace last occurrence of pattern in a third last line of a fileWant to replace the value of 4th column of data2.csv with the 4th column of other csv file data1.csv where 3rd and 4th column are same in both files
.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty margin-bottom:0;
I have some files in the format
Y15-SUB-B04-P17-BK_M02734_4_000000000-ANNUF_1_1111_24724_4878;size=1;
Y15-SUB-B05-P22-LM_M02734_4_000000000-ANNUF_1_1111_20624_14973;size=1;
Y15-SUB-B05-P22-LM_M02734_4_000000000-ANNUF_1_1103_11326_10379;size=1;
I wish to replace every occurrence of the underscore (_) with a colon (:) EXCEPT for the first one. I want an output like this:
Y15-SUB-B04-P17-BK_M02734:4:000000000-ANNUF:1:1111:24724:4878;size=1;
Y15-SUB-B05-P22-LM_M02734:4:000000000-ANNUF:1:1111:20624:14973;size=1;
Y15-SUB-B05-P22-LM_M02734:4:000000000-ANNUF:1:1103:11326:10379;size=1;
I know I can use sed -i '' 's/_/:/g' old_file
to replace ALL (or sed 's/_/:/g' old_file > new_file
), and that I could add numbers to replace only the 2nd, 4th or so occurrence:
sed 's/_/:/2' old_file > new_file
But how to replace every occurrence on each line BUT the first?
text-processing sed
add a comment |
I have some files in the format
Y15-SUB-B04-P17-BK_M02734_4_000000000-ANNUF_1_1111_24724_4878;size=1;
Y15-SUB-B05-P22-LM_M02734_4_000000000-ANNUF_1_1111_20624_14973;size=1;
Y15-SUB-B05-P22-LM_M02734_4_000000000-ANNUF_1_1103_11326_10379;size=1;
I wish to replace every occurrence of the underscore (_) with a colon (:) EXCEPT for the first one. I want an output like this:
Y15-SUB-B04-P17-BK_M02734:4:000000000-ANNUF:1:1111:24724:4878;size=1;
Y15-SUB-B05-P22-LM_M02734:4:000000000-ANNUF:1:1111:20624:14973;size=1;
Y15-SUB-B05-P22-LM_M02734:4:000000000-ANNUF:1:1103:11326:10379;size=1;
I know I can use sed -i '' 's/_/:/g' old_file
to replace ALL (or sed 's/_/:/g' old_file > new_file
), and that I could add numbers to replace only the 2nd, 4th or so occurrence:
sed 's/_/:/2' old_file > new_file
But how to replace every occurrence on each line BUT the first?
text-processing sed
Maybe a two step process? Replace all underscores with colons, then replace the first colon with an underscore?
– 0xSheepdog
11 hours ago
1
Sure, that would work, only my file is some 14+ gb, and each replacement process takes about 1 hour, so if there were just one parsing step, that would be preferable. Thank you though.
– Christoffer Bugge Harder
11 hours ago
1
Ah, so. Good info to have in the question, then. Things like requirements and limitations help us consider the entire situation.
– 0xSheepdog
11 hours ago
14+GB/1hour??? If it were me, I'd write a small (it would be very small) C program to do this rather than sed. I'd guess that would run in <5min.
– John Forkosh
1 hour ago
add a comment |
I have some files in the format
Y15-SUB-B04-P17-BK_M02734_4_000000000-ANNUF_1_1111_24724_4878;size=1;
Y15-SUB-B05-P22-LM_M02734_4_000000000-ANNUF_1_1111_20624_14973;size=1;
Y15-SUB-B05-P22-LM_M02734_4_000000000-ANNUF_1_1103_11326_10379;size=1;
I wish to replace every occurrence of the underscore (_) with a colon (:) EXCEPT for the first one. I want an output like this:
Y15-SUB-B04-P17-BK_M02734:4:000000000-ANNUF:1:1111:24724:4878;size=1;
Y15-SUB-B05-P22-LM_M02734:4:000000000-ANNUF:1:1111:20624:14973;size=1;
Y15-SUB-B05-P22-LM_M02734:4:000000000-ANNUF:1:1103:11326:10379;size=1;
I know I can use sed -i '' 's/_/:/g' old_file
to replace ALL (or sed 's/_/:/g' old_file > new_file
), and that I could add numbers to replace only the 2nd, 4th or so occurrence:
sed 's/_/:/2' old_file > new_file
But how to replace every occurrence on each line BUT the first?
text-processing sed
I have some files in the format
Y15-SUB-B04-P17-BK_M02734_4_000000000-ANNUF_1_1111_24724_4878;size=1;
Y15-SUB-B05-P22-LM_M02734_4_000000000-ANNUF_1_1111_20624_14973;size=1;
Y15-SUB-B05-P22-LM_M02734_4_000000000-ANNUF_1_1103_11326_10379;size=1;
I wish to replace every occurrence of the underscore (_) with a colon (:) EXCEPT for the first one. I want an output like this:
Y15-SUB-B04-P17-BK_M02734:4:000000000-ANNUF:1:1111:24724:4878;size=1;
Y15-SUB-B05-P22-LM_M02734:4:000000000-ANNUF:1:1111:20624:14973;size=1;
Y15-SUB-B05-P22-LM_M02734:4:000000000-ANNUF:1:1103:11326:10379;size=1;
I know I can use sed -i '' 's/_/:/g' old_file
to replace ALL (or sed 's/_/:/g' old_file > new_file
), and that I could add numbers to replace only the 2nd, 4th or so occurrence:
sed 's/_/:/2' old_file > new_file
But how to replace every occurrence on each line BUT the first?
text-processing sed
text-processing sed
edited 14 mins ago
Inian
6,2301633
6,2301633
asked 11 hours ago
Christoffer Bugge HarderChristoffer Bugge Harder
333
333
Maybe a two step process? Replace all underscores with colons, then replace the first colon with an underscore?
– 0xSheepdog
11 hours ago
1
Sure, that would work, only my file is some 14+ gb, and each replacement process takes about 1 hour, so if there were just one parsing step, that would be preferable. Thank you though.
– Christoffer Bugge Harder
11 hours ago
1
Ah, so. Good info to have in the question, then. Things like requirements and limitations help us consider the entire situation.
– 0xSheepdog
11 hours ago
14+GB/1hour??? If it were me, I'd write a small (it would be very small) C program to do this rather than sed. I'd guess that would run in <5min.
– John Forkosh
1 hour ago
add a comment |
Maybe a two step process? Replace all underscores with colons, then replace the first colon with an underscore?
– 0xSheepdog
11 hours ago
1
Sure, that would work, only my file is some 14+ gb, and each replacement process takes about 1 hour, so if there were just one parsing step, that would be preferable. Thank you though.
– Christoffer Bugge Harder
11 hours ago
1
Ah, so. Good info to have in the question, then. Things like requirements and limitations help us consider the entire situation.
– 0xSheepdog
11 hours ago
14+GB/1hour??? If it were me, I'd write a small (it would be very small) C program to do this rather than sed. I'd guess that would run in <5min.
– John Forkosh
1 hour ago
Maybe a two step process? Replace all underscores with colons, then replace the first colon with an underscore?
– 0xSheepdog
11 hours ago
Maybe a two step process? Replace all underscores with colons, then replace the first colon with an underscore?
– 0xSheepdog
11 hours ago
1
1
Sure, that would work, only my file is some 14+ gb, and each replacement process takes about 1 hour, so if there were just one parsing step, that would be preferable. Thank you though.
– Christoffer Bugge Harder
11 hours ago
Sure, that would work, only my file is some 14+ gb, and each replacement process takes about 1 hour, so if there were just one parsing step, that would be preferable. Thank you though.
– Christoffer Bugge Harder
11 hours ago
1
1
Ah, so. Good info to have in the question, then. Things like requirements and limitations help us consider the entire situation.
– 0xSheepdog
11 hours ago
Ah, so. Good info to have in the question, then. Things like requirements and limitations help us consider the entire situation.
– 0xSheepdog
11 hours ago
14+GB/1hour??? If it were me, I'd write a small (it would be very small) C program to do this rather than sed. I'd guess that would run in <5min.
– John Forkosh
1 hour ago
14+GB/1hour??? If it were me, I'd write a small (it would be very small) C program to do this rather than sed. I'd guess that would run in <5min.
– John Forkosh
1 hour ago
add a comment |
7 Answers
7
active
oldest
votes
Using GNU sed
(other versions may behave differently, thanks glenn jackman):
sed -i'' 's/_/:/2g' file
This will change all _
to :
skipping the first occurrence on each line.
This is GNU sed. Other sed implementations act differently when given both2
andg
(e.g. the BSD-derived sed on MacOS gives an error)
– glenn jackman
10 hours ago
add a comment |
Here is another simple awk
script,(standar Linux gawk
), no loops:
cat script.awk
match($0,/^[^_]*_/,a) # match current line to first _ (including) into a[0] variable
sub(a[0],""); # remove a[0] from current line
gsub("_",":"); # replace all _ to : in current line
print a[0]""$0; # ouput a[0] and current line
run:
awk -f script.awk input.txt
or:
awk 'match($0,/^[^_]*_/,a)sub(a[0],"");gsub("_",":");print a[0]""$0;' input.txt
add a comment |
Is awk okay? You could use _
as the field separator, and print out:
<field 1>_<field 2>:<field n>:<field n+1>:...
Like this:
awk -F_ ' printf("%s_%s", $1, $2); for (x = 3; x <=NF; x++) printf(":%s", $x); ; printf("n"); '
If the structure is the same for each line you could hard-code the number of fields to avoid the loop (runs in about 2/3 of the time according to a very rough preliminary trial):
awk -F_ 'printf("%s_%s:%s:%s:%s:%s:%s:%sn", $1, $2, $3, $4, $5, $6, $7, $8);'
Thank you so much - I am unfortunately almost helpless in AWK, so I would have to be spoon-fed the exact command to make it work. if the first file is file1, and the second the output file (file2), then what should I write exactly? Sorry for my ineptitude.
– Christoffer Bugge Harder
11 hours ago
Addfile1 > file2
to the end of the command, to have it read in file1 and write out to file2. Just like you would withsed
-- both parse text a line at a time!
– drewbenn
11 hours ago
Note the sed commands in the other answers seem to be faster than either of these commands.
– drewbenn
11 hours ago
add a comment |
Simple sed command will work fine for this
command:sed "s/_/:/2g" filename
output
Y15-SUB-B04-P17-BK_M02734:4:000000000-ANNUF:1:1111:24724:4878;size=1;
Y15-SUB-B05-P22-LM_M02734:4:000000000-ANNUF:1:1111:20624:14973;size=1;
Y15-SUB-B05-P22-LM_M02734:4:000000000-ANNUF:1:1103:11326:10379;size=1;
Note:Suppose if you want to replace in same file use below command
sed -i "s/_/:/2g" filename
This gives me this error message: > sed: 1: "s/:/_/2g": more than one number or 'g' in substitute flags
– Christoffer Bugge Harder
9 hours ago
add a comment |
With perl
, to match the character _
and replace from the first instance on-wards as below.
perl -pe '$n=0s_++$n > 1 ? ":" : $&;ge' file
The part s_
identifies the _
within the line and if its the 2nd occurrence replace with :
or replace with the same character($&
)
add a comment |
Using Posix-sed
constructs only we do like as:
$ sed -e '
y/_/n/
s/n/_/
y/n/:/
' inp.file
$ perl -pe '1 while s/_.*?K_/:/g' inp.file
add a comment |
As per my comment below your question, here's the (very) small C program that accomplishes the described functionality, and probably an order-of-magnitude (or two) faster than sed...
#include <stdio.h>
#include <string.h>
int main ( int argc, char *argv[] )
FILE *infp = (argc>1? fopen(argv[1],"r") : stdin ),
*outfp = (argc>2? fopen(argv[2],"w") : stdout );
char line[9999], *lptr=line, this = '_', that = ':';
int nskip = 1, nfound = 0;
if ( infp != NULL && outfp != NULL )
while ( (lptr=fgets(line,9990,infp)) != NULL )
nfound = 0;
while ( (lptr=strchr(lptr,this)) != NULL )
if ( ++nfound > nskip ) *lptr = that;
lptr++;
if ( fputs(line,outfp) == EOF ) break;
/* --- end-of-while(fgets()!=NULL) --- */
if ( infp != stdin ) fclose(infp);
if ( outfp != stdout ) fclose(outfp);
/* --- end-of-if(in,outfp!=NULL) --- */
/* --- end-of-function main() --- */
(P.S. If that's your boat, and if you're ever in-or-around NYC Harbor and need crew, email me. I'm a recreational sailor myself.)
add a comment |
Your Answer
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "106"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f522203%2fhow-to-skip-replacing-first-occurrence-of-a-character-in-each-line%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
7 Answers
7
active
oldest
votes
7 Answers
7
active
oldest
votes
active
oldest
votes
active
oldest
votes
Using GNU sed
(other versions may behave differently, thanks glenn jackman):
sed -i'' 's/_/:/2g' file
This will change all _
to :
skipping the first occurrence on each line.
This is GNU sed. Other sed implementations act differently when given both2
andg
(e.g. the BSD-derived sed on MacOS gives an error)
– glenn jackman
10 hours ago
add a comment |
Using GNU sed
(other versions may behave differently, thanks glenn jackman):
sed -i'' 's/_/:/2g' file
This will change all _
to :
skipping the first occurrence on each line.
This is GNU sed. Other sed implementations act differently when given both2
andg
(e.g. the BSD-derived sed on MacOS gives an error)
– glenn jackman
10 hours ago
add a comment |
Using GNU sed
(other versions may behave differently, thanks glenn jackman):
sed -i'' 's/_/:/2g' file
This will change all _
to :
skipping the first occurrence on each line.
Using GNU sed
(other versions may behave differently, thanks glenn jackman):
sed -i'' 's/_/:/2g' file
This will change all _
to :
skipping the first occurrence on each line.
edited 10 hours ago
answered 11 hours ago
FreddyFreddy
3,9431417
3,9431417
This is GNU sed. Other sed implementations act differently when given both2
andg
(e.g. the BSD-derived sed on MacOS gives an error)
– glenn jackman
10 hours ago
add a comment |
This is GNU sed. Other sed implementations act differently when given both2
andg
(e.g. the BSD-derived sed on MacOS gives an error)
– glenn jackman
10 hours ago
This is GNU sed. Other sed implementations act differently when given both
2
and g
(e.g. the BSD-derived sed on MacOS gives an error)– glenn jackman
10 hours ago
This is GNU sed. Other sed implementations act differently when given both
2
and g
(e.g. the BSD-derived sed on MacOS gives an error)– glenn jackman
10 hours ago
add a comment |
Here is another simple awk
script,(standar Linux gawk
), no loops:
cat script.awk
match($0,/^[^_]*_/,a) # match current line to first _ (including) into a[0] variable
sub(a[0],""); # remove a[0] from current line
gsub("_",":"); # replace all _ to : in current line
print a[0]""$0; # ouput a[0] and current line
run:
awk -f script.awk input.txt
or:
awk 'match($0,/^[^_]*_/,a)sub(a[0],"");gsub("_",":");print a[0]""$0;' input.txt
add a comment |
Here is another simple awk
script,(standar Linux gawk
), no loops:
cat script.awk
match($0,/^[^_]*_/,a) # match current line to first _ (including) into a[0] variable
sub(a[0],""); # remove a[0] from current line
gsub("_",":"); # replace all _ to : in current line
print a[0]""$0; # ouput a[0] and current line
run:
awk -f script.awk input.txt
or:
awk 'match($0,/^[^_]*_/,a)sub(a[0],"");gsub("_",":");print a[0]""$0;' input.txt
add a comment |
Here is another simple awk
script,(standar Linux gawk
), no loops:
cat script.awk
match($0,/^[^_]*_/,a) # match current line to first _ (including) into a[0] variable
sub(a[0],""); # remove a[0] from current line
gsub("_",":"); # replace all _ to : in current line
print a[0]""$0; # ouput a[0] and current line
run:
awk -f script.awk input.txt
or:
awk 'match($0,/^[^_]*_/,a)sub(a[0],"");gsub("_",":");print a[0]""$0;' input.txt
Here is another simple awk
script,(standar Linux gawk
), no loops:
cat script.awk
match($0,/^[^_]*_/,a) # match current line to first _ (including) into a[0] variable
sub(a[0],""); # remove a[0] from current line
gsub("_",":"); # replace all _ to : in current line
print a[0]""$0; # ouput a[0] and current line
run:
awk -f script.awk input.txt
or:
awk 'match($0,/^[^_]*_/,a)sub(a[0],"");gsub("_",":");print a[0]""$0;' input.txt
answered 10 hours ago
Dudi BoyDudi Boy
29826
29826
add a comment |
add a comment |
Is awk okay? You could use _
as the field separator, and print out:
<field 1>_<field 2>:<field n>:<field n+1>:...
Like this:
awk -F_ ' printf("%s_%s", $1, $2); for (x = 3; x <=NF; x++) printf(":%s", $x); ; printf("n"); '
If the structure is the same for each line you could hard-code the number of fields to avoid the loop (runs in about 2/3 of the time according to a very rough preliminary trial):
awk -F_ 'printf("%s_%s:%s:%s:%s:%s:%s:%sn", $1, $2, $3, $4, $5, $6, $7, $8);'
Thank you so much - I am unfortunately almost helpless in AWK, so I would have to be spoon-fed the exact command to make it work. if the first file is file1, and the second the output file (file2), then what should I write exactly? Sorry for my ineptitude.
– Christoffer Bugge Harder
11 hours ago
Addfile1 > file2
to the end of the command, to have it read in file1 and write out to file2. Just like you would withsed
-- both parse text a line at a time!
– drewbenn
11 hours ago
Note the sed commands in the other answers seem to be faster than either of these commands.
– drewbenn
11 hours ago
add a comment |
Is awk okay? You could use _
as the field separator, and print out:
<field 1>_<field 2>:<field n>:<field n+1>:...
Like this:
awk -F_ ' printf("%s_%s", $1, $2); for (x = 3; x <=NF; x++) printf(":%s", $x); ; printf("n"); '
If the structure is the same for each line you could hard-code the number of fields to avoid the loop (runs in about 2/3 of the time according to a very rough preliminary trial):
awk -F_ 'printf("%s_%s:%s:%s:%s:%s:%s:%sn", $1, $2, $3, $4, $5, $6, $7, $8);'
Thank you so much - I am unfortunately almost helpless in AWK, so I would have to be spoon-fed the exact command to make it work. if the first file is file1, and the second the output file (file2), then what should I write exactly? Sorry for my ineptitude.
– Christoffer Bugge Harder
11 hours ago
Addfile1 > file2
to the end of the command, to have it read in file1 and write out to file2. Just like you would withsed
-- both parse text a line at a time!
– drewbenn
11 hours ago
Note the sed commands in the other answers seem to be faster than either of these commands.
– drewbenn
11 hours ago
add a comment |
Is awk okay? You could use _
as the field separator, and print out:
<field 1>_<field 2>:<field n>:<field n+1>:...
Like this:
awk -F_ ' printf("%s_%s", $1, $2); for (x = 3; x <=NF; x++) printf(":%s", $x); ; printf("n"); '
If the structure is the same for each line you could hard-code the number of fields to avoid the loop (runs in about 2/3 of the time according to a very rough preliminary trial):
awk -F_ 'printf("%s_%s:%s:%s:%s:%s:%s:%sn", $1, $2, $3, $4, $5, $6, $7, $8);'
Is awk okay? You could use _
as the field separator, and print out:
<field 1>_<field 2>:<field n>:<field n+1>:...
Like this:
awk -F_ ' printf("%s_%s", $1, $2); for (x = 3; x <=NF; x++) printf(":%s", $x); ; printf("n"); '
If the structure is the same for each line you could hard-code the number of fields to avoid the loop (runs in about 2/3 of the time according to a very rough preliminary trial):
awk -F_ 'printf("%s_%s:%s:%s:%s:%s:%s:%sn", $1, $2, $3, $4, $5, $6, $7, $8);'
edited 11 hours ago
answered 11 hours ago
drewbenndrewbenn
5,49451936
5,49451936
Thank you so much - I am unfortunately almost helpless in AWK, so I would have to be spoon-fed the exact command to make it work. if the first file is file1, and the second the output file (file2), then what should I write exactly? Sorry for my ineptitude.
– Christoffer Bugge Harder
11 hours ago
Addfile1 > file2
to the end of the command, to have it read in file1 and write out to file2. Just like you would withsed
-- both parse text a line at a time!
– drewbenn
11 hours ago
Note the sed commands in the other answers seem to be faster than either of these commands.
– drewbenn
11 hours ago
add a comment |
Thank you so much - I am unfortunately almost helpless in AWK, so I would have to be spoon-fed the exact command to make it work. if the first file is file1, and the second the output file (file2), then what should I write exactly? Sorry for my ineptitude.
– Christoffer Bugge Harder
11 hours ago
Addfile1 > file2
to the end of the command, to have it read in file1 and write out to file2. Just like you would withsed
-- both parse text a line at a time!
– drewbenn
11 hours ago
Note the sed commands in the other answers seem to be faster than either of these commands.
– drewbenn
11 hours ago
Thank you so much - I am unfortunately almost helpless in AWK, so I would have to be spoon-fed the exact command to make it work. if the first file is file1, and the second the output file (file2), then what should I write exactly? Sorry for my ineptitude.
– Christoffer Bugge Harder
11 hours ago
Thank you so much - I am unfortunately almost helpless in AWK, so I would have to be spoon-fed the exact command to make it work. if the first file is file1, and the second the output file (file2), then what should I write exactly? Sorry for my ineptitude.
– Christoffer Bugge Harder
11 hours ago
Add
file1 > file2
to the end of the command, to have it read in file1 and write out to file2. Just like you would with sed
-- both parse text a line at a time!– drewbenn
11 hours ago
Add
file1 > file2
to the end of the command, to have it read in file1 and write out to file2. Just like you would with sed
-- both parse text a line at a time!– drewbenn
11 hours ago
Note the sed commands in the other answers seem to be faster than either of these commands.
– drewbenn
11 hours ago
Note the sed commands in the other answers seem to be faster than either of these commands.
– drewbenn
11 hours ago
add a comment |
Simple sed command will work fine for this
command:sed "s/_/:/2g" filename
output
Y15-SUB-B04-P17-BK_M02734:4:000000000-ANNUF:1:1111:24724:4878;size=1;
Y15-SUB-B05-P22-LM_M02734:4:000000000-ANNUF:1:1111:20624:14973;size=1;
Y15-SUB-B05-P22-LM_M02734:4:000000000-ANNUF:1:1103:11326:10379;size=1;
Note:Suppose if you want to replace in same file use below command
sed -i "s/_/:/2g" filename
This gives me this error message: > sed: 1: "s/:/_/2g": more than one number or 'g' in substitute flags
– Christoffer Bugge Harder
9 hours ago
add a comment |
Simple sed command will work fine for this
command:sed "s/_/:/2g" filename
output
Y15-SUB-B04-P17-BK_M02734:4:000000000-ANNUF:1:1111:24724:4878;size=1;
Y15-SUB-B05-P22-LM_M02734:4:000000000-ANNUF:1:1111:20624:14973;size=1;
Y15-SUB-B05-P22-LM_M02734:4:000000000-ANNUF:1:1103:11326:10379;size=1;
Note:Suppose if you want to replace in same file use below command
sed -i "s/_/:/2g" filename
This gives me this error message: > sed: 1: "s/:/_/2g": more than one number or 'g' in substitute flags
– Christoffer Bugge Harder
9 hours ago
add a comment |
Simple sed command will work fine for this
command:sed "s/_/:/2g" filename
output
Y15-SUB-B04-P17-BK_M02734:4:000000000-ANNUF:1:1111:24724:4878;size=1;
Y15-SUB-B05-P22-LM_M02734:4:000000000-ANNUF:1:1111:20624:14973;size=1;
Y15-SUB-B05-P22-LM_M02734:4:000000000-ANNUF:1:1103:11326:10379;size=1;
Note:Suppose if you want to replace in same file use below command
sed -i "s/_/:/2g" filename
Simple sed command will work fine for this
command:sed "s/_/:/2g" filename
output
Y15-SUB-B04-P17-BK_M02734:4:000000000-ANNUF:1:1111:24724:4878;size=1;
Y15-SUB-B05-P22-LM_M02734:4:000000000-ANNUF:1:1111:20624:14973;size=1;
Y15-SUB-B05-P22-LM_M02734:4:000000000-ANNUF:1:1103:11326:10379;size=1;
Note:Suppose if you want to replace in same file use below command
sed -i "s/_/:/2g" filename
answered 11 hours ago
Praveen Kumar BSPraveen Kumar BS
1,9552311
1,9552311
This gives me this error message: > sed: 1: "s/:/_/2g": more than one number or 'g' in substitute flags
– Christoffer Bugge Harder
9 hours ago
add a comment |
This gives me this error message: > sed: 1: "s/:/_/2g": more than one number or 'g' in substitute flags
– Christoffer Bugge Harder
9 hours ago
This gives me this error message: > sed: 1: "s/:/_/2g": more than one number or 'g' in substitute flags
– Christoffer Bugge Harder
9 hours ago
This gives me this error message: > sed: 1: "s/:/_/2g": more than one number or 'g' in substitute flags
– Christoffer Bugge Harder
9 hours ago
add a comment |
With perl
, to match the character _
and replace from the first instance on-wards as below.
perl -pe '$n=0s_++$n > 1 ? ":" : $&;ge' file
The part s_
identifies the _
within the line and if its the 2nd occurrence replace with :
or replace with the same character($&
)
add a comment |
With perl
, to match the character _
and replace from the first instance on-wards as below.
perl -pe '$n=0s_++$n > 1 ? ":" : $&;ge' file
The part s_
identifies the _
within the line and if its the 2nd occurrence replace with :
or replace with the same character($&
)
add a comment |
With perl
, to match the character _
and replace from the first instance on-wards as below.
perl -pe '$n=0s_++$n > 1 ? ":" : $&;ge' file
The part s_
identifies the _
within the line and if its the 2nd occurrence replace with :
or replace with the same character($&
)
With perl
, to match the character _
and replace from the first instance on-wards as below.
perl -pe '$n=0s_++$n > 1 ? ":" : $&;ge' file
The part s_
identifies the _
within the line and if its the 2nd occurrence replace with :
or replace with the same character($&
)
answered 10 hours ago
InianInian
6,2301633
6,2301633
add a comment |
add a comment |
Using Posix-sed
constructs only we do like as:
$ sed -e '
y/_/n/
s/n/_/
y/n/:/
' inp.file
$ perl -pe '1 while s/_.*?K_/:/g' inp.file
add a comment |
Using Posix-sed
constructs only we do like as:
$ sed -e '
y/_/n/
s/n/_/
y/n/:/
' inp.file
$ perl -pe '1 while s/_.*?K_/:/g' inp.file
add a comment |
Using Posix-sed
constructs only we do like as:
$ sed -e '
y/_/n/
s/n/_/
y/n/:/
' inp.file
$ perl -pe '1 while s/_.*?K_/:/g' inp.file
Using Posix-sed
constructs only we do like as:
$ sed -e '
y/_/n/
s/n/_/
y/n/:/
' inp.file
$ perl -pe '1 while s/_.*?K_/:/g' inp.file
answered 9 hours ago
Rakesh SharmaRakesh Sharma
549126
549126
add a comment |
add a comment |
As per my comment below your question, here's the (very) small C program that accomplishes the described functionality, and probably an order-of-magnitude (or two) faster than sed...
#include <stdio.h>
#include <string.h>
int main ( int argc, char *argv[] )
FILE *infp = (argc>1? fopen(argv[1],"r") : stdin ),
*outfp = (argc>2? fopen(argv[2],"w") : stdout );
char line[9999], *lptr=line, this = '_', that = ':';
int nskip = 1, nfound = 0;
if ( infp != NULL && outfp != NULL )
while ( (lptr=fgets(line,9990,infp)) != NULL )
nfound = 0;
while ( (lptr=strchr(lptr,this)) != NULL )
if ( ++nfound > nskip ) *lptr = that;
lptr++;
if ( fputs(line,outfp) == EOF ) break;
/* --- end-of-while(fgets()!=NULL) --- */
if ( infp != stdin ) fclose(infp);
if ( outfp != stdout ) fclose(outfp);
/* --- end-of-if(in,outfp!=NULL) --- */
/* --- end-of-function main() --- */
(P.S. If that's your boat, and if you're ever in-or-around NYC Harbor and need crew, email me. I'm a recreational sailor myself.)
add a comment |
As per my comment below your question, here's the (very) small C program that accomplishes the described functionality, and probably an order-of-magnitude (or two) faster than sed...
#include <stdio.h>
#include <string.h>
int main ( int argc, char *argv[] )
FILE *infp = (argc>1? fopen(argv[1],"r") : stdin ),
*outfp = (argc>2? fopen(argv[2],"w") : stdout );
char line[9999], *lptr=line, this = '_', that = ':';
int nskip = 1, nfound = 0;
if ( infp != NULL && outfp != NULL )
while ( (lptr=fgets(line,9990,infp)) != NULL )
nfound = 0;
while ( (lptr=strchr(lptr,this)) != NULL )
if ( ++nfound > nskip ) *lptr = that;
lptr++;
if ( fputs(line,outfp) == EOF ) break;
/* --- end-of-while(fgets()!=NULL) --- */
if ( infp != stdin ) fclose(infp);
if ( outfp != stdout ) fclose(outfp);
/* --- end-of-if(in,outfp!=NULL) --- */
/* --- end-of-function main() --- */
(P.S. If that's your boat, and if you're ever in-or-around NYC Harbor and need crew, email me. I'm a recreational sailor myself.)
add a comment |
As per my comment below your question, here's the (very) small C program that accomplishes the described functionality, and probably an order-of-magnitude (or two) faster than sed...
#include <stdio.h>
#include <string.h>
int main ( int argc, char *argv[] )
FILE *infp = (argc>1? fopen(argv[1],"r") : stdin ),
*outfp = (argc>2? fopen(argv[2],"w") : stdout );
char line[9999], *lptr=line, this = '_', that = ':';
int nskip = 1, nfound = 0;
if ( infp != NULL && outfp != NULL )
while ( (lptr=fgets(line,9990,infp)) != NULL )
nfound = 0;
while ( (lptr=strchr(lptr,this)) != NULL )
if ( ++nfound > nskip ) *lptr = that;
lptr++;
if ( fputs(line,outfp) == EOF ) break;
/* --- end-of-while(fgets()!=NULL) --- */
if ( infp != stdin ) fclose(infp);
if ( outfp != stdout ) fclose(outfp);
/* --- end-of-if(in,outfp!=NULL) --- */
/* --- end-of-function main() --- */
(P.S. If that's your boat, and if you're ever in-or-around NYC Harbor and need crew, email me. I'm a recreational sailor myself.)
As per my comment below your question, here's the (very) small C program that accomplishes the described functionality, and probably an order-of-magnitude (or two) faster than sed...
#include <stdio.h>
#include <string.h>
int main ( int argc, char *argv[] )
FILE *infp = (argc>1? fopen(argv[1],"r") : stdin ),
*outfp = (argc>2? fopen(argv[2],"w") : stdout );
char line[9999], *lptr=line, this = '_', that = ':';
int nskip = 1, nfound = 0;
if ( infp != NULL && outfp != NULL )
while ( (lptr=fgets(line,9990,infp)) != NULL )
nfound = 0;
while ( (lptr=strchr(lptr,this)) != NULL )
if ( ++nfound > nskip ) *lptr = that;
lptr++;
if ( fputs(line,outfp) == EOF ) break;
/* --- end-of-while(fgets()!=NULL) --- */
if ( infp != stdin ) fclose(infp);
if ( outfp != stdout ) fclose(outfp);
/* --- end-of-if(in,outfp!=NULL) --- */
/* --- end-of-function main() --- */
(P.S. If that's your boat, and if you're ever in-or-around NYC Harbor and need crew, email me. I'm a recreational sailor myself.)
answered 46 mins ago
John ForkoshJohn Forkosh
1208
1208
add a comment |
add a comment |
Thanks for contributing an answer to Unix & Linux Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f522203%2fhow-to-skip-replacing-first-occurrence-of-a-character-in-each-line%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Maybe a two step process? Replace all underscores with colons, then replace the first colon with an underscore?
– 0xSheepdog
11 hours ago
1
Sure, that would work, only my file is some 14+ gb, and each replacement process takes about 1 hour, so if there were just one parsing step, that would be preferable. Thank you though.
– Christoffer Bugge Harder
11 hours ago
1
Ah, so. Good info to have in the question, then. Things like requirements and limitations help us consider the entire situation.
– 0xSheepdog
11 hours ago
14+GB/1hour??? If it were me, I'd write a small (it would be very small) C program to do this rather than sed. I'd guess that would run in <5min.
– John Forkosh
1 hour ago