InvalidArgumentError：重塑的输入是具有 178802 个值的张量，但请求的形状具有 89401 个值

Question

InvalidArgumentError：重塑的输入是具有 178802 个值的张量，但请求的形状具有 89401 个值

Moo*_*dra 5 python python-3.x deep-learning tensorflow

我遇到了另一个无效参数错误，但不太确定这次的原因是什么。

我创建了一个 TFRecord，其中包含形状为 [299,299] 的图像（据我所知是混合扩展）。

我正在尝试批量加载图像，但遇到此错误：

'InvalidArgumentError (see above for traceback): Input to reshape is a tensor with 178802 values, but the requested shape has 89401
     [[Node: Reshape = Reshape[T=DT_FLOAT, Tshape=DT_INT32, _device="/job:localhost/replica:0/task:0/cpu:0"](DecodeRaw, Reshape/shape)]]

Run Code Online (Sandbox Code Playgroud)

这是我的代码：

import tensorflow as tf
import numpy as np
import matplotlib.pyplot as plt
import os

IMAGE_DIR =r'C:\Users\Moondra\Desktop\TF_FISH_PROJECT\FINAL_FISHES'

data_path = r'E:\TFRECORDS\normal_fish_conversion_2.tfrecords'  

with tf.Session() as sess:
    feature = {'train/image': tf.FixedLenFeature([], tf.string),
               'train/label': tf.FixedLenFeature([], tf.int64),
               'rows':  tf.FixedLenFeature([], tf.int64),
                'columns':  tf.FixedLenFeature([], tf.int64)}

    # Create a list of filenames and pass it to a queue
    filename_queue = tf.train.string_input_producer([data_path], num_epochs=1000)

    # Define a reader and read the next record
    reader = tf.TFRecordReader()
    _, serialized_example = reader.read(filename_queue)

    # Decode the record read by the reader
    features = tf.parse_single_example(serialized_example, features=feature)

    # Convert the image data from string back to the numbers
    image = tf.decode_raw(features['train/image'], tf.float32)

    # Cast label data into int32
    label = tf.cast(features['train/label'], tf.int32)

    # Reshape image data into the original shape
    image = tf.reshape(image, [299, 299])
    print(image.shape) #shape is printing out correctly


    # Creates batches by randomly shuffling tensors
    #images, labels = tf.train.shuffle_batch([image, label], batch_size=50, capacity=10000, num_threads=3, min_after_dequeue=2000)
    init_op = tf.group(tf.global_variables_initializer(), tf.local_variables_initializer())
    sess.run(init_op)
    coord = tf.train.Coordinator()
    threads = tf.train.start_queue_runners(coord=coord)

    for batch_index in range(5):
            img  = sess.run([image])
            img = img.astype(np.uint8)
            print(img.shape)





    coord.request_stop()
    coord.join(threads)
    sess.close()

Run Code Online (Sandbox Code Playgroud)

我不太确定如何调试这个..

第一个打印语句（reshape_image.shape）打印出（299,299）形状，所以不确定问题是什么。

谢谢。

Answer 1

tsv*_*iko 3

我需要做的是将图像解码为 JPEG，将其转换为浮点数，扩展其尺寸，然后使用双线性插值调整其大小，如下所示：

image = tf.image.decode_jpeg(features['train/image'], channels=3)
image = tf.image.convert_image_dtype(image, dtype=tf.float32)
image = tf.expand_dims(image, 0)
image = tf.image.resize_bilinear(image, [299, 299], align_corners=False)

Run Code Online (Sandbox Code Playgroud)

笔记：

您的图像应该已经以 JPEG 格式存储（创建 TFRecord 时）。
如果图像是灰度图像，则可以将其设置channels为 1，或者将每个图像的通道数保存在 TFRecords 中并从那里动态获取（每个图像都不同）。

归档时间：	8 年，3 月前
查看次数：	4853 次
最近记录：	7 年，8 月前