Fix linear's with permute copy

mcr229 · facebook-github-bot · commit 5ca981f582b3 · 2023-08-18T17:31:23.000-07:00
Summary:
There are some issues with permute_copy in both partitioner as well as convert_to_linear pass

### Partitioner:
For Quantized Partitions, we fail to pull in the q/dq nodes above permute_copy.

```
get_attr --&gt; q --&gt; dq --&gt; permute_copy --&gt; addm
```
The solution is checking the inputs to the source_partition for permute_copy node, and if it is one of them, then we add it to the partition and check its inputs

### Convert to Linear Pass
In the pattern
```
get_attr --&gt; q --&gt; dq --&gt; permute_copy --&gt; addmm
```
We replace the entire source partition with just linear, however we fail to delete the permute_copy because it is an input to the source partition instead of the dq (dq should actually be the input to the linear source partition). The weight given to linear should not be the result of permute copy, but should instead be the input to permute copy.

This happens because q and dq are not tagged as part of the linear source partition, so permute_copy becomes the input to the source partition.

Differential Revision: D48488931

fbshipit-source-id: a650a334cca2ce2e9da8f04805b519a19cbf1011
diff --git a/backends/xnnpack/partition/xnnpack_partitioner.py b/backends/xnnpack/partition/xnnpack_partitioner.py
@@ -682,6 +682,9 @@ def get_input_deps(  # noqa
         """
         nodes = set()
         for inp in input_nodes:
+            if inp.target == exir_ops.edge.aten.permute_copy.default:
+                nodes.add(inp)
+                inp = cast(torch.fx.Node, inp.args[0])
             if inp.target in self._DQ_OPS:
                 # dequant node
                 nodes.add(inp)
diff --git a/backends/xnnpack/passes/convert_to_linear.py b/backends/xnnpack/passes/convert_to_linear.py
@@ -120,6 +120,8 @@ def create_linear(
             src_partition.input_nodes
             + src_partition.params,  # non quant weight can be in params
         )
+        if linear_weight.target == exir_ops.edge.aten.permute_copy.default:
+            linear_weight = linear_weight.args[0]
         logger.debug(f"Found weight: {linear_weight} from node {node}")
 
         linear_bias = self.find(

Original file line number	Diff line number	Diff line change
`@@ -120,6 +120,8 @@ def create_linear(`
`120`	`120`	`src_partition.input_nodes`
`121`	`121`	`+ src_partition.params, # non quant weight can be in params`
`122`	`122`	`)`
	`123`	`+ if linear_weight.target == exir_ops.edge.aten.permute_copy.default:`
	`124`	`+ linear_weight = linear_weight.args[0]`
`123`	`125`	`logger.debug(f"Found weight: {linear_weight} from node {node}")`
`124`	`126`
`125`	`127`	`linear_bias = self.find(`