开发者

how to read this text from file

how to read the text below?

‰€ˆ‡‰�#îõ‘þüŠ ꑯõù ‚†ƒ� -#�ª÷‘þü “‘ ª“îù )øþ¦ùý ¤ª—ùý î‘õ•þø—¤(#•¢þ¢� ø¤÷¢ù ꑯõù îõ‘þü#^a—ú¤�ö^b•¦øû÷¢ð‘ö ¤�ù ¢�÷©^cˆˆƒ�#‚€� «.: õ¬ø¤Š ›¢øñ#…�…ˆí/Š…/ …€�…Š}TK{^aˆˆƒ�#†/„€€#}BF{#ª“îùû‘ý î‘õ•þø—¤ý#ª“îùû‘ý î‘õ•þø—¤ý -- �¥õøöû‘#^c

I use this code but not display all characters

FileStream fs = new FileStream(open.FileName, FileMode.Open, FileAccess.Read);
        System.Text.Encoding enc = System.Text.Encoding.UTF8 ;
        byte[] data = new byte[fs.Length];
        fs.Read(data, 0, data.Length);
        string text = enc.GetString(data);

and show text :

†‰€ˆ‡‰Â�#îõâ� �˜Ã¾Ã¼Å ꑯõù ‚†ƒÂ� -#Â�ª÷‘þü “‘ ª“îù )øþ¦ùý ¤ª—ùý î‘õ•þø—¤(#â€� �¢þ¢Â� ø¤÷¢ù ꑯõù îõ‘þü#^a—ú¤Â�ö ^b•¦øû÷¢ð‘ö ¤Â�ù ¢Â�֩^cˆˆƒÂ�#‚₠¬Â� «.: õ¬ø¤Š›¢øñ#…Â�…ˆí/Š…/ …€Â�…Š}TK{^aˆˆƒÂ�#†/„€€#}BF{#ª“îùû ‘ý î‘õ•þø—¤ý#Â� �“îùû‘ý î‘õ

this is a TEXT DOS and encoding this text is:

IBM037

IBM437

IBM500

ASMO-708

DOS-720

ibm737

ibm775

ibm850

ibm852

IBM855

ibm857

IBM00858

IBM860

ibm861

DOS-862

IBM863

IBM864

IBM865

cp866

ibm869

IBM870

windows-874

cp875

shift_jis

gb2312

ks_c_5601-1987

big5

IBM1026

IBM01047

IBM01140

IBM01141

IBM01142

IBM01143

IBM01144


IBM01145

IBM01146

IBM01147

IBM01148

IBM01149

utf-16

unicodeFFFE

windows-1250

windows-1251

Windows-1252

windows-1253

windows-1254

windows-1255

windows-1256

windows-1257

windows-1258

Johab

macintosh

x-mac-japanese

x-mac-chinesetrad

x-mac-korean

x-mac-arabic

x-mac-hebrew

x-mac-greek

x-mac-cyrillic



x-mac-chinesesimp

x-mac-romanian

x-mac-ukrainian

x-mac-thai

x-mac-ce

x-mac-icelandic

x-mac-turkish

x-mac-croatian

utf-32

utf-32BE

x-Chinese-CNS

x-cp20001

x-Chinese-Eten

x-cp20003

x-cp20004

x-cp20005

x-IA5

x-IA5-German

x-IA5-Swedish

x-IA5-Norwegian

us-ascii

x-cp20261

x-cp20269

IBM273

IBM277

IBM278

IBM280

IBM284

IBM285

IBM290

IBM297

IBM420

IBM423

IBM424

x-EBCDIC-KoreanExtended

IBM-Thai

koi8-r

IBM871

IBM880

IBM905

IBM00924

EUC-JP

x-cp20936

x-cp20949

cp1025

koi8-u

iso-8859-1

iso-8859-2

iso-8859-3

iso-8859-4

iso-8859-5

is开发者_如何学JAVAo-8859-6

iso-8859-7

iso-8859-8

iso-8859-9

iso-8859-13

iso-8859-15

x-Europa

iso-8859-8-i

iso-2022-jp

csISO2022JP

iso-2022-jp

iso-2022-kr

x-cp50227

euc-jp

EUC-CN

euc-kr

hz-gb-2312

GB18030

x-iscii-de

x-iscii-be

x-iscii-ta

x-iscii-te

x-iscii-as

x-iscii-or

x-iscii-ka

x-iscii-ma

x-iscii-gu

x-iscii-pa

utf-7

utf-8


To read the file you need to know what encoding used in this file.

If you don't know, you can iterate through all encodings and see if find the one that works.

const string FileName = "FileName";
foreach (var encodingInfo in Encoding.GetEncodings())
{
    try
    {
        var encoding = encodingInfo.GetEncoding();
        var text = File.ReadAllText(FileName, encoding);
        Console.WriteLine("{0} - {1}", encodingInfo.Name, text.Substring(0, 20));

        // put break point and check if text is readable here
    }
    catch (Exception ex)
    {
        Console.WriteLine("Failed: {0}", encodingInfo.Name);
    }
}

Disclaimer: assuming this is a text file, assuming the file isn't huge.


Well it looks like you're trying to open a .dat file, which is probably written with a byte format by the looks of it

Try the following code

        File readThis = new File("file directory");
        byte[] aByte = new byte[(int)readThis.length()];
        FileInputStream Fis = new FileInputStream(readThis);
        Fis.read(aByte);
        System.out.println(Contents: "+aByte);
        Fis.close();

Let me know how it goes :)

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜