SVM Non Linearly Seperable Data

SVM (Non-Linearly Separable Data)

When the data is not linearly separable, Support Vector Machines (SVM) employ a technique called the "kernel trick" to transform the data into a higher-dimensional space where it becomes linearly separable. This is done using a kernel function.

Kernel Function:

A kernel function computes the dot product of two vectors in a higher-dimensional space without explicitly transforming the vectors to that space.

Common kernel functions include:

Linear Kernel:

K(x,y)=x⋅y

Polynomial Kernel:

K(x,y)=(x⋅y+c)^d

Radial Basis Function (RBF) Kernel:

K(x,y)=exp(−γ∥x−y∥^2)

import numpy as np
import matplotlib.pyplot as plt

x = np.linspace(-5.0, 5.0, 100)
y = np.sqrt(10**2 - x**2)
y=np.hstack([y,-y])
x=np.hstack([x,-x])


x1 = np.linspace(-5.0, 5.0, 100)
y1 = np.sqrt(5**2 - x1**2)
y1=np.hstack([y1,-y1])
x1=np.hstack([x1,-x1])
plt.scatter(y,x)
plt.scatter(y1,x1)

<matplotlib.collections.PathCollection at 0x7e8cf3f5a3b0>


import pandas as pd
df1 =pd.DataFrame(np.vstack([y,x]).T,columns=['X1','X2'])
df1['Y']=0
df2 =pd.DataFrame(np.vstack([y1,x1]).T,columns=['X1','X2'])
df2['Y']=1
df = pd.concat([df1, df2], ignore_index=True)
df.head(5)

	X1	X2
0	8.660254	-5.00000
1	8.717792	-4.89899
2	8.773790	-4.79798
3	8.828277	-4.69697
4	8.881281	-4.59596


X = df.iloc[:, :2]
y = df.Y

0      0
1      0
2      0
3      0
4      0
      ..
395    1
396    1
397    1
398    1
399    1
Name: Y, Length: 400, dtype: int64

from sklearn.model_selection import train_test_split
X_train,X_test,y_train,y_test=train_test_split(X,y,test_size=0.25,random_state=0)

y_train

250    1
63     0
312    1
159    0
283    1
      ..
323    1
192    0
117    0
47     0
172    0
Name: Y, Length: 300, dtype: int64

from sklearn.svm import SVC
classifier=SVC(kernel="rbf")
classifier.fit(X_train,y_train)

SVC()

In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.


from sklearn.metrics import accuracy_score
y_pred = classifier.predict(X_test)
accuracy_score(y_test, y_pred)

1.0


df.head()

df['X1_Square']= df['X1']**2
df['X2_Square']= df['X2']**2
df['X1*X2'] = (df['X1'] *df['X2'])
df.head()

	X1	X2	X1_Square	X2_Square	X1*X2
0	8.660254	-5.00000	75.000000	25.000000	-43.301270
1	8.717792	-4.89899	75.999898	24.000102	-42.708375
2	8.773790	-4.79798	76.979390	23.020610	-42.096467
3	8.828277	-4.69697	77.938476	22.061524	-41.466150
4	8.881281	-4.59596	78.877155	21.122845	-40.818009


### Independent and Dependent features
X = df[['X1','X2','X1_Square','X2_Square','X1*X2']]
y = df['Y']

X_train, X_test, y_train, y_test = train_test_split(X, y,
                                                    test_size = 0.25,
                                                    random_state = 0)


X_train

	X1	X2	X1_Square	X2_Square	X1*X2
250	4.999745	0.050505	24.997449	0.002551	0.252512
63	9.906589	1.363636	98.140496	1.859504	13.508984
312	-3.263736	3.787879	10.651974	14.348026	-12.362637
159	-9.953852	-0.959596	99.079176	0.920824	9.551676
283	3.680983	3.383838	13.549638	11.450362	12.455852
...	...	...	...	...	...
323	-4.223140	2.676768	17.834915	7.165085	-11.304366
192	-9.031653	-4.292929	81.570758	18.429242	38.772248
117	-9.445795	3.282828	89.223038	10.776962	-31.008922
47	9.996811	-0.252525	99.936231	0.063769	-2.524447
172	-9.738311	-2.272727	94.834711	5.165289	22.132526

300 rows × 5 columns

import plotly.express as px

fig = px.scatter_3d(df, x='X1', y='X2', z='X1*X2',
              color='Y')
fig.show()


fig = px.scatter_3d(df, x='X1_Square', y='X1_Square', z='X1*X2',
              color='Y')
fig.show()

classifier = SVC(kernel="linear")
classifier.fit(X_train, y_train)
y_pred = classifier.predict(X_test)
accuracy_score(y_test, y_pred)

1.0

SVM (Non-Linearly Separable Data)

When the data is not linearly separable, Support Vector Machines (SVM) employ a technique called the "kernel trick" to transform the data into a higher-dimensional space where it becomes linearly separable. This is done using a kernel function.

Kernel Function:

A kernel function computes the dot product of two vectors in a higher-dimensional space without explicitly transforming the vectors to that space.

Common kernel functions include:

Linear Kernel:

K(x,y)=x⋅y

Polynomial Kernel:

K(x,y)=(x⋅y+c)^d

Radial Basis Function (RBF) Kernel:

K(x,y)=exp(−γ∥x−y∥^2)

Non-Linearly-seperable Dataset

Adding Labels to the datafraem

Independent and Dependent features

Split the dataset into train and test

Polynomial Kernel

We need to find components for the Polynomical Kernel

X1,X2,X1_square,X2_square,X1*X2