關(guān)于String內(nèi)的indexOf方法的一些疑問

sunnyxd 發(fā)布于2019-08-16 14:00 / 3119人閱讀

摘要：今天瀏覽了一下里的類，發(fā)現(xiàn)一個(gè)靜態(tài)方法有點(diǎn)意思，就是我們常用的的底層實(shí)現(xiàn)，先看下代碼調(diào)用鏈。所以字符串的長度是可以不用匹配的，故是沒問題的。關(guān)鍵的地方是這里加上了，是字符串的起始匹配偏移量，即從的哪個(gè)字符開始匹配。

今天瀏覽了一下java里的String類，發(fā)現(xiàn)一個(gè)靜態(tài)方法有點(diǎn)意思，就是我們常用的indexOf(String str)的底層實(shí)現(xiàn)，先看下代碼調(diào)用鏈。

public int indexOf(String str) {
    return indexOf(str, 0);
}
    
public int indexOf(String str, int fromIndex) {
    return indexOf(value, 0, value.length,
            str.value, 0, str.value.length, fromIndex);
}

static int indexOf(char[] source, int sourceOffset, int sourceCount,
        String target, int fromIndex) {
    return indexOf(source, sourceOffset, sourceCount,
                   target.value, 0, target.value.length,
                   fromIndex);
}

/**
 * Code shared by String and StringBuffer to do searches. The
 * source is the character array being searched, and the target
 * is the string being searched for.
 *
 * @param   source       the characters being searched.
 * @param   sourceOffset offset of the source string.
 * @param   sourceCount  count of the source string.
 * @param   target       the characters being searched for.
 * @param   targetOffset offset of the target string.
 * @param   targetCount  count of the target string.
 * @param   fromIndex    the index to begin searching from.
 */
static int indexOf(char[] source, int sourceOffset, int sourceCount,
        char[] target, int targetOffset, int targetCount,
        int fromIndex) {
    if (fromIndex >= sourceCount) {
        return (targetCount == 0 ? sourceCount : -1);
    }
    if (fromIndex < 0) {
        fromIndex = 0;
    }
    if (targetCount == 0) {
        return fromIndex;
    }

    char first = target[targetOffset];
    int max = sourceOffset + (sourceCount - targetCount);

    for (int i = sourceOffset + fromIndex; i <= max; i++) {
        /* Look for first character. */
        if (source[i] != first) {
            while (++i <= max && source[i] != first);
        }

        /* Found first character, now look at the rest of v2 */
        if (i <= max) {
            int j = i + 1;
            int end = j + targetCount - 1;
            for (int k = targetOffset + 1; j < end && source[j]
                    == target[k]; j++, k++);

            if (j == end) {
                /* Found whole string. */
                return i - sourceOffset;
            }
        }
    }
    return -1;
}

底層的字符串匹配的邏輯比較簡單，就是普通的匹配模式：

查找首字符，匹配target的第一個(gè)字符在source內(nèi)的位置，若查找到max位置還找到，則返回-1；

若在source匹配到了target的第一個(gè)字符，那么在依次比較srouce和target后面的字符，一直到target的末尾；

如果target后面的字符與source都已經(jīng)匹配，則返回在source上匹配到的第一個(gè)字符的相對下標(biāo)，否則返回-1。

但是仔細(xì)讀代碼會(huì)發(fā)現(xiàn)一個(gè)問題，就是這里

int max = sourceOffset + (sourceCount - targetCount);

max的計(jì)算方式，max的作用是計(jì)算出最大的首字符匹配次數(shù)，取值范圍應(yīng)該是"max <= sourceCount"。
所以target字符串的長度是可以不用匹配的，故“sourceCount - targetCount”是沒問題的。
關(guān)鍵的地方是這里加上了sourceOffset，sourceOffset是source字符串的起始匹配偏移量，即從source的哪個(gè)字符開始匹配。
所以，根據(jù)代碼里的max計(jì)算方式，最終計(jì)算出來的max值是會(huì)有可能大于sourceCount。
看下測試代碼：

package string;

/**
 * string test
 */
public class StringTest {

    static int indexOf(char[] source, int sourceOffset, int sourceCount,
                       char[] target, int targetOffset, int targetCount,
                       int fromIndex) {
        if (fromIndex >= sourceCount) {
            return (targetCount == 0 ? sourceCount : -1);
        }
        if (fromIndex < 0) {
            fromIndex = 0;
        }
        if (targetCount == 0) {
            return fromIndex;
        }

        char first = target[targetOffset];
        int max = sourceOffset + (sourceCount - targetCount);

        for (int i = sourceOffset + fromIndex; i <= max; i++) {
            /* Look for first character. */
            if (source[i] != first) {
                while (++i <= max && source[i] != first);
            }

            /* Found first character, now look at the rest of v2 */
            if (i <= max) {
                int j = i + 1;
                int end = j + targetCount - 1;
                for (int k = targetOffset + 1; j < end && source[j]
                        == target[k]; j++, k++);

                if (j == end) {
                    /* Found whole string. */
                    return i - sourceOffset;
                }
            }
        }
        return -1;
    }

    public static void main(String[] args) {
        String source = "abcdefghigklmn";
        String target = "n";
        int sourceOffset = 5;
        int targetOffset = 0;

        int index = indexOf(source.toCharArray(), sourceOffset, source.length(), target.toCharArray(), targetOffset, target.length(), 0);
        System.out.println(index);
    }
}

如果target在source內(nèi)可以匹配到返回正確結(jié)果8（結(jié)果8是相對于sourceOffset的結(jié)果，如果轉(zhuǎn)換成source內(nèi)的位置則是13）。
但是如果target在source內(nèi)匹配不到，則會(huì)拋出java.lang.ArrayIndexOutOfBoundsException異常，如下：

Exception in thread "main" java.lang.ArrayIndexOutOfBoundsException: 14
    at string.StringTest.indexOf(StringTest.java:27)
    at string.StringTest.main(StringTest.java:52)

可見報(bào)出越界的下標(biāo)是14,這就是由于max = sourceOffset + (sourceCount - targetCount)引起，計(jì)算出的max值為：17。

所以，個(gè)人認(rèn)為max計(jì)算這里是個(gè)潛在的BUG，應(yīng)該改為 int max = sourceCount - targetCount;

不過這個(gè)方法是一個(gè)非public方法，只在String內(nèi)部調(diào)用，同時(shí)也跟蹤了所有對該方法的調(diào)用鏈，都是傳入的默認(rèn)0,在使用時(shí)不會(huì)出現(xiàn)數(shù)組越界問題。
不知這是開發(fā)者故意為之，還是其它我未知用意，歡迎大家交流討論?。?！

GPU云服務(wù)器云服務(wù)器數(shù)據(jù)分析的一些方法 js匿名函數(shù)內(nèi)的方法 ecs服務(wù)器一個(gè)時(shí)段內(nèi)的流量查詢方法 indexof的用法

文章版權(quán)歸作者所有，未經(jīng)允許請勿轉(zhuǎn)載,若此文章存在違規(guī)行為，您可以聯(lián)系管理員刪除。

轉(zhuǎn)載請注明本文地址：http://systransis.cn/yun/72701.html

發(fā)表評論

登陸后可評論

0條評論

sunnyxd

男|高級講師

我要關(guān)注我要私信

TA的文章

文本溢出顯示省略號

閱讀 3521·2019-08-30 15:53
曾經(jīng)面試踩過的坑，都在這里了～

閱讀 3435·2019-08-29 16:54
自制簡單的range（Vue）

閱讀 2220·2019-08-29 16:41
uni-app項(xiàng)目展示屏幕文字滾動(dòng)效果

閱讀 2448·2019-08-23 16:10
淺談JavaScript代碼預(yù)解析 + 示例詳解

閱讀 3402·2019-08-23 15:04
使用 Solid 私有化存儲(chǔ) IPFS 文件哈希值

閱讀 1376·2019-08-23 13:58
JavaScript六種非常經(jīng)典的對象繼承方式

閱讀 376·2019-08-23 11:40
HTML5 新特性

閱讀 2480·2019-08-23 10:26

成人国产在线小视频_日韩寡妇人妻调教在线播放_色成人www永久在线观看_2018国产精品久久_亚洲欧美高清在线30p_亚洲少妇综合一区_黄色在线播放国产_亚洲另类技巧小说校园_国产主播xx日韩_a级毛片在线免费

資訊專欄INFORMATION COLUMN

上云采購季！| 2核2G4M爆款云服務(wù)器低至59元/年，更有多臺(tái)、長期優(yōu)惠，快來選購！

關(guān)于String內(nèi)的indexOf方法的一些疑問

相關(guān)文章

javascript 數(shù)組去重的6種思路

也談JavaScript數(shù)組去重

Javascripts數(shù)組原生方法集合

**JS數(shù)組中的indexOf方法**

字符串的四則運(yùn)算表達(dá)式

發(fā)表評論

0條評論

sunnyxd

男|高級講師

TA的文章

文本溢出顯示省略號

曾經(jīng)面試踩過的坑，都在這里了～

自制簡單的range（Vue）

uni-app項(xiàng)目展示屏幕文字滾動(dòng)效果

淺談JavaScript代碼預(yù)解析 + 示例詳解

使用 Solid 私有化存儲(chǔ) IPFS 文件哈希值

JavaScript六種非常經(jīng)典的對象繼承方式

HTML5 新特性

最新活動(dòng)

資訊專欄INFORMATION COLUMN

上云采購季！| 2核2G4M爆款云服務(wù)器低至59元/年，更有多臺(tái)、長期優(yōu)惠，快來選購！

關(guān)于String內(nèi)的indexOf方法的一些疑問

相關(guān)文章

發(fā)表評論

0條評論

男|高級講師

TA的文章

最新活動(dòng)

上云采購季！| 2核2G4M爆款云服務(wù)器低至59元/年，更有多臺(tái)、長期優(yōu)惠，快來選購！