Word frequency: Difference between revisions

Word frequency (view source)

83 bytes added , 1 year ago

→‎{{header|Raku}}: use % operator, add type constraint and properly align output

1,934

edits

Revision as of 08:58, 24 September 2022 (view source) Grondilu (talk \| contribs) (→‎{{header\|Raku}}: wikipedia link to diacritics) ← Older edit		Revision as of 09:35, 24 September 2022 (view source) Grondilu (talk \| contribs) (→‎{{header\|Raku}}: use % operator, add type constraint and properly align output) Newer edit →
Line 3,978: =={{header\|Raku}}== (formerly Perl 6) {{works with\|Rakudo\|~~2020~~2022.~~08.1~~07}} Note: much of the following exposition is no longer critical to the task as the requirements have been updated, but is left here for historical and informational reasons. Line 3,991: Here is a sample that shows the result when using various different matchers. <syntaxhighlight lang="raku" line>sub MAIN ($filename, UInt $top = 10) { my $file = $filename.IO.slurp.lc.subst(/ (<[\w]-[_]>'-')\n(<[\w]-[_]>) /, {$0 ~ $1}, :g ); my @matcher = ( rx/ <[a..z]>+ /, # simple 7-bit ASCII rx/ \w+ /, # word characters with underscore rx/ <[\w]-[_]>+ /, # word characters without underscore rx/ [<[\w]-[_]>+~~[["'"\|~~]+ % < ' -~~'\|"~~ '-~~"]<[\w]-[_]~~ >+]* / # word characters without underscore but with hyphens and contractions ); for @matcher -> $reg { say "\nTop $top using regex: ", $reg.raku; my @words ~~.put for~~= $file.comb( $reg ).Bag.sort(-*.value)[^$top]; my $length = max @words».key».chars; printf "%-{$length}s %d\n", .key, .value for @words; } }</syntaxhighlight>